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(57) Abstract: Nucleic acid constructs containing HTV-1 GAG/POL and SIV gag or SIV env genes which have been mutated to 
remove or reduce inhibitory/instability sequences are disclosed. Viral particles and host cells containing these constructs and/or 
viral particles arc also disclosed. The exemplified constructs and viral panicles of the invention may be useful in gene therapy for 
numberous disorders, including HIV infection, or as a vaccine for HTV-i immunotherapy and immunoprophylaxis. 
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MOLECULAR CLONES WITH MUTATED 
HIV GAG/POL, SIV GAG AND SIV ENV GENES 

I. TECHNICAL FIELD 

The invention relates to nucleic acids comprising mutated HIV-1 
gag/pol and SIV gag gene sequences which are capable of being expressed 
independently of any SIV or HTV regulatory factors. The invention also relates to 
nucleic acids comprising a mutated SIV env gene sequence, which is capable of 
being expressed independently of any SIV or HIV regulatory factors. The preferred 
nucleic acids of the invention are capable of producing infectious viral particles. 

The invention also relates to vectors, vector systems and host cells 
comprising the mutated HIV-1 gag, HIV-1 pol and/or SIV gag gene sequences. The 
invention also relates host cells comprising these nucleic acids and/or vectors or 
vector systems. The invention also relates to the use of these nucleic acids, vectors, 
vector systems and/or host cells for use in gene therapy or as vaccines. 

H. BACKGROUND 

Until recently, gene therapy protocols have often relied on vectors 
derived from retroviruses, such as murine leukemia virus (MLV). These vectors are 
useful because the genes they transduce are integrated into the genome of the target 
cells, a desirable feature for long-term expression. However, these retroviral vectors 
can only transduce dividing cells, which limits their use for in vivo gene transfer in 
nonproliferating cells, such as hepatocytes, myofibers, hematopoietic stem cells, and 
neurons. 

Lentiviruses are a type of retrovirus that can infect both dividing and 
nondividing cells. They have proven extremely efficient at providing long-term 
gene expression (for up to 6 months) in a variety of nondividing cells (such as, 
neurons and macrophages) in animal models. See, e.g., Amado et al., Science 
285:674-676 (July 1999). It has been proposed that the optimal gene transfer 
system would include a vector based on HTV, or other lentivirus, that can integrate 
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into the genome of nonproliferating cells. Because retroviruses integrate in the 
genome of the target cells, repeated transduction is unnecessary. Therefore, in 
contrast to an adenoviral vector capable of in vivo gene delivery, problems linked to 
the humoral response to injected viral antigens can be avoided. See, e.g., Naldini et 
al., Science, 272:263-267 (1996), p. 263. 

HTV and other lentiviruses have a complex genome that, in addition 
to the essential structural genes (env, gag, and pol), contains regulatory (tat and rev) 
and accessory genes (vpr, vif vpu, and nef). HIV has evolved to efficiently infect 
and express its genes in human cells, and is able to infect nondividing cells such as 
macrophages because its preintegration complex can traverse the intact membrane 
of the nucleus in the target cell. This complex contains, in addition to the viral 
DNA, the enzyme integrase, the product of the vpr gene, and a protein encoded by 
the gag gene called matrix. The matrix protein enables the preintegration complex 
to pass into the nucleus to access the host DNA. Lentiviruses cannot efficiently 
transduce truly quiescent cells (cells in the Go state). However, unlike murine 
retroviral vectors, in addition to being able to infect dividing cells, HTV-based 
vectors can achieve effective and sustained transduction and expression of 
therapeutic genes in nondividing cells, such as hematopoietic stem cells and in 
terminally differentiated cells such as neurons, retinal photoreceptors, muscle, and 
liver cells. See, e.g., Amado et al. (July 1999) and Klimatcheva et al., Frontiers in 
Bioscience 4:d48 1-496 (June 1999), and the references cited therein. 

Although lentiviral vectors can be efficient gene delivery vehicles, 
there are safety concerns due to their origin. Therefore, the field has turned its 
attention to the development of vectors and production systems with built-in safety 
features to prevent the emergence of replication competent lentivirus (RCL). For 
example, in most laboratory applications, lentiviral vectors are generally created in a 
transient system in which a cell line is transfected with three separate constructs: a 
packaging construct, a transfer construct, and an envelope encoding construct. The 
packaging construct contains the elements necessary for vector packaging (except 
for env) and the enzymes required to generate vector particles. The transfer 
construct contains genetic cis-acting sequences necessary for the vector to infect the 
target cell and for transfer of the therapeutic (or reporter) gene. The lentivirus env 
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gene is generally deleted from the packaging construct and instead the envelope 
gene of a different virus is supplied in a third vector "the env-coding vector", 
although the Antiviruses em gene may be used if it is desired that the vector be 
intended to infect CD4 + T cells. A commonly used envelope gene is that encoding 
the G glycoprotein of the vesicular stomatitis virus (VSV-G), which can infect a 
wide variey of cells and in addition confers stability to the particle and permits the 
vector to be concentrated to high titers (see, e.g., Naldini et al., Science 272:263-267 
(1996) and Akkina et al. J. Virol. 70:2581 (1996). The use of three separate 
constructs and the absence of overlapping sequences between them minimizes the 
possibility of recombination during lentivirus (transfer) vector production. In 
addition, because no viral proteins are expressed by the lentiviral (transfer) vector 
itself, they do not trigger an effective immune response against cells expressing 
vector in animal models (a particular problem with vectors based on adenovirus). 
See, e.g., Amado et al., Science 285:674-676 (July 1999) and the references cited 
therein. See also Naldini et al. Science 272:263-267 (1996). 

The initial packaging plasmids contained most HIV genes except for 
env. In an effort to improve safety, subsequent HTV vectors have been produced in 
which the packaging plasmid is devoid of all accessory genes. This process does 
not interfere with efficient vector production and significantly increases the safety 
of the system because potential RCLs lack the accessory genes necessary for 
efficient replication of HIV in humans. Although these vectors can transduce 
growth-arrested cell lines and neurons in vivo, they have been reported to not 
efficiently transduce macrophages. The accessory gene vpr is believed to be 
necessary for HIV infection of these cells using these HTV vectors. See, Zufferey et 
al, Nature Biotechnol. 15:871-875 (1997). In contrast, as discussed later herein, 
the HIV-based lentiviral vectors of the present invention do not need any HIV 
accessory genes in order to be able to infect human macrophages and the other cells 
tested. 

The requirement of vpr or vi/for efficient transduction of liver cells 
has also been reported. See, e.g., Kafri et al., Nature Genet. 17:314 (1997). These 
results indicate that the requirement of accessory genes for efficient lentivirus- 
mediated gene transfer is dependent on the type of cell chosen as target, suggesting 
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that future applications of lentiviral vectors may involve vector constructs with 
different accessory genes, as needed. 

Zufferey et al., (1 997) describe an HIV vector system in which the 
virulence genes, env, vif, vpr, vpu, and we/have been deleted. This multiply 
attenuated vector conserved the ability to transduce growth-arTested cells and 
monocyte-derived macrophages in culture, and could efficiently deliver genes in 
vivo into adult neurons. The packaging plasmids described Zufferey et al. (1997) 
and Naldini et al. (1996) encode Rev and Tat, in addition to Gag and Pol. 

Lentiviral vectors engineered to become packaged into virions in the 
absence of the regulatory gene tat have also been described. See, e.g., Kim et al., J. 
Virol. 72:81 1-816 (1998) and Miyoshietal. J. Virol. 72:8150-8157 (1998). In 
these vectors the tat gene has been removed from the packaging plasmid. Kim et al. 
state that tat is not necessary as long as the serial 5' LTR promoter is replaced with 
a strong constitutive promoter. It also has other advantages for HIV therapy. 
Replacement of the HIV-1 LTR with a constitutive HCMV promoter permits the use 
of anti-Tat molecules such as Tat transdominant mutants or Tat activation response 
element decoys as therapeutic agents, since they will not affect vector production, 
(see p. 814, col. 2). The removal of the tat gene eliminates an essential virulence 
factor that could contribute to a possible RCL. Kim et al. (1998) describe a vector 
system which does not contain tat, vif, vpr, vpu and nef. The preferred vector 
system includes the rev gene which, the authors state "with RRE, is required for 
efficient RNA handling in this system." (p. 81 1, col. 2). However, Kim et al. also 
constructed Rev independent constructs using CTE. Kim et al. state that the 
rev/KRE components could be removed by using a sequence such as the Mason- 
Pfizer monkey virus (MPMV) constitutive transport element (CTE), thereby 
eliminating all accessory proteins, but this leads to a significant reduction in titer. 

Srinivasakumar et al., J. Virol. 71:5841-5848 (1997) describes the 
generation of stable HTV-I packaging lines that constitutively express high levels of 
HTV-1 structural proteins in either a Rev-dependent or a Rev-independent fashion. 
These cell lines were used to assess gene transfer by using a HTV-1 vector 
expressing the hygromycin B resistance gene and to study the effects of Rev, Tat, 
and Nef on the vector titer. The Rev-independent cell lines were created by using 
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gag-pol and env expression vectors that contain the MPMV CTE. This article 
describes the construction of four plasmids, among others: CMV gagpol-RRE and 
pCMVenv, which require Rev coexpression for HIV-1 structural gene expression, 
and pCMV gagpol-CTE and pCMVenv-CTE, which do not. To create Rev- 
containing and Rev-independent packaging, cell lines, CMT3 cells were transfected 
with vectors expressing Gag, Gag-Pol, and Env, using a calcium phosphate 
transfection procedure. 

By creating an HIV vector which contained the MPMV CTE 
(pTRl 67-CTE) and a packaging cell line which expressed the HIV structural 
proteins in a Rev-independent fashion, the authors were able to obtain a HTV vector 
system that functions completely without Rev. The titer of the vector obtained from 
this system was essentially the same as that obtained from a parallel system which 
contained Rev. The authors state that, in this context, the CTE seemed to substitute 
completely for Rev-RRE functions, similar to what was previously observed in 
transient-expression assays with Rev-dependent constructs. This is in contrast to 
situations where several rounds of HTV replication were measured. In those cases, 
titers from CTE-containing viruses were always reduced by at least 1 log unit 
compared to viruses utilizing Rev and the RRE. (See, Srinivasakumar et al, p. 
5847). 

The authors state that the advantages of having a HIV vector system 
that works in the absence of Rev opens the possibility of using it as a delivery 
vehicle for intracellular immunization against Rev function. Genes encoding Rev 
antagonists that have dramatic inhibitory effects on HTV replication, such as Rev 
M10 or RRE decoys, could be introduced into an HTV vector and put into cells 
normally infectable by HTV. Expression of the "anti-Rev" gene would be expected 
to dampen HTV infection. Any residual HTV replication should lead to activation of 
the vector LTR (by Tat) and create a vector-derived RNA that would be packaged 
by proteins derived from the infectious virus. In this scenario, the wild-type virus 
would act as a helper that may allow the spread of vector particles to previously 
nonimmunized cells. Because of the additional vector spread, it is likely that this 
type of scheme will be more effective in modulating HTV infection in vivo than one 
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based on traditional retrovirus vectors. The authors state that they are currently 
testing this approach in model systems. (See, Srinivasakumar et al., p. 5847). 

Another development in the quest for a safe system is the so-called 
self-inactivating (SIN) vector. See, e.g., Yu et al., Proc Natl Acad Sci USA 
83:3194-8 (1986) and Miyoshi et al., J, Virol. 72:8150 (1998). In Yu et al., a 
retrovirus-derived vector SIN vector was designed for the transduction of whole 
genes into mammalian cells. The SIN vector of Yu et al. contains a deletion of 299 
base pairs in the 3' long terminal repeat (LTR), which includes sequences encoding 
the enhancer and promoter functions. When viruses derived from such vectors were 
used to infect NIH 3T3 cells, the deletion was transferred to the 5' LTR, resulting in 
the transcriptional inactivation of the provirus in the infected cell. Introduction of a 
hybrid gene (human metallothionein-promoted c-fos) into cells via a SIN vector was 
not associated with rearrangements and led to the formation of an authentic mKNA 
transcript, which in some cases was induced by cadmium. The vector described in 
Miyoshi et al. also contains a deletion the 3' (downstream) LTR. A sequence within 
the upstream LTR serves as a promoter under which the viral genome is expressed. 
The deletion introduced in the downstream LTR is transferred to the upstream LTR 
during reverse transcription. This deletion inactivates the LTR promoter and 
eliminates the production of vector RNA. The gene (or genes) to be transferred 
(e.g., a reporter or therapeutic gene) is expressed from an exogenous viral or cellular 
promoter that is inserted into the lentivirus vector. An important safety feature of 
SIN vectors is that inactivation of the promoter activity of the LTR reduces the 
possibility of insertional mutagenesis (of the transfer vector) into the host genome. 
In addition, because the expression of the (transfer) vector RNA is eliminated, the 
potential for RCL production in the target cell is further minimized. SIN vectors 
should be particularly useful in gene transfer experiments designed to study the 
regulated expression of genes in mammalian cells. Absence of enhancer and 
promoter sequences in both LTRs of the integrated provirus should also minimize 
the possibility of activating cellular oncogenes and may provide a safer alternative 
to be used in human gene therapy. Other modifications to enhance safety and 
specificity include the use of specific internal promoters that regulate gene 
expression, either temporally or with tissue or cell specificity. 
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Other strategies to improve safety in human studies would be to use 
nonhuman lentiviruses such as simian immunodeficiency virus, bovine 
immunodeficiency virus, or equine infectious anemia virus. Of these, vectors 
derived from the feline immunodeficiency virus have been engineered to efficiently 
transduce nondividing human cells. See, e.g., Poeschla et al., Nature Med. 4:354- 
357 (1998) and WO 99/15641. In addition, White et al., J. Virol. 73:2832-2840 
(April 1999) described lentiviral vectors using human and simian immunodeficient 
virus elements in attempt to improve safety by reducing the likelihood of 
recombination between packaging constructs and transfer constructs. 

The development of efficient packaging lines has proven challenging 
because expression of the VSV-G envelope and a number of HTV proteins is toxic to 
cells. Recently, a producer line has been designed in which the expression of 
packaging genes and VS V-G, and therefore the production of vector, can be turned 
on at will. Kafri et al., J. Virol. 73-576-584 (1999). The cell line can be expanded 
for scale-up vector production when the expression of toxic genes is turned off. 
This cell line produces high titer vector without generating RCL. Hematopoietic 
stem cells transduced with an HIV vector were transplanted into rhesus macaques as 
described by Donahue et al. Blood 92 (suppl. 1), abstract 4648.5 (1998) with at least 
a 14-month follow-up. At that time the procedure proved to be safe; all animals in 
the study have remained healthy without evidence of circulating HIV or vector. 
See, Amado et al., Science 285:674-676 (July 1999). 

Many gene therapy protocols have been designed to correct a number 
of inherited metabolic, infectious, or malignant diseases using the hematopoietic 
stem cell. This cell has the capacity to self-renew and to differentiate into all of the 
mature cells of the blood and immune systems. Many diseases that affect these 
systems could potentially be treated by the stable introduction of therapeutic genes 
into stem cells. Recently, lentiviral vectors were shown to bypass the need for ex 
vivo stem cell stimulation (which is necessary when using murine retroviral 
vectors), by mediating efficient gene transfer into very primitive human stem cells 
that contributed to stable, long-term reconstitution of SCID mouse bone marrow 
with many hematopoietic lineages. See, e.g., Miyoshi et al., Science 283:682 
(1999). Similarly, in a rhesus macaque model of autologous transplantation with 
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lenti virus-transduced stem cells, multilineage gene expression was found, 
suggesting transduction of an early blood cell progenitor under conditions of 
minimal stem cell stimulation, ordinarily insufficient for transduction with murine 
retroviruses. See, Donahue et al., Blood 92 (suppl. 1), abstract 4648.5 (1999) and 
Amado et al., Science 285:674-676 (July 1999). 

In HTV infection, another advantage of lentiviral vectors designed 
against HIV is their potential to be mobilized by HIV in the infected patient, 
because the virus supplies all of the necessary elements for packaging of the vector. 
If these mobilized vectors contained the HTV envelope, they could efficiently 
transfer their genes (for example, genes custom-designed to confer resistance 
against HIV) into CD4 + T cells, protecting them from subsequent HIV infection. 
Lentiviral vectors can also be designed to efficiently express their genes only in 
CD4 + T cells that are infected with HIV (so called taf-inducible vectors). In these 
vectors, all HIV genes, including tat and rev, are ablated; cis-acting sequences 
required for integration, expression, and packaging are retained, and expression is 
dependent on the activity of the HIV LTR (which requires transactivation by Tat). 
It has been shown that in this system, vector expression is induced efficiently upon 
HTV infection. Moreover, in the absence of genes that confer resistance against 
HTV, stable integration of this vector in permissive cell lines resulted in inhibition of 
HTV replication. Although the mechanism of HTV inhibition has not been 
completely elucidated, preliminary results suggest that this vector competes with 
HTV at the level of reverse transcription. See, An et al. 3 J. Virol., in press, and 
Amado et al., Science 285:674-676 (1999). 

A number of other potential medical applications, where the 
modification of the genetic material of quiescent cells could result in the prevention 
or reversal of a disease process, are beginning to be explored. For example, the 
finding that lentiviral vectors can mediate stable and long-term gene transfer by 
direct injection of vector into the rat and mouse retina has lent support to the notion 
of gene therapy for the treatment of retinitis pigmentosa. This degenerative disease 
of the retina is characterized by photoreceptor cell death, resulting in a slow 
progression to blindness. Mutations in the cGMP phosphodiesterase JJ subunit 
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(PDEP) gene of rod photoreceptors lead to an autosomal recessive form of retinitis 
pigmentosa in humans, and in the rd mouse model of the disease. Previous studies 
have shown that adenovirus and adeno-associated virus-mediated PDEP subretinal 
gene transfer results in a delay in photoreceptor cell death. Using the rd mouse 
model, a recent study demonstrated that photoreceptors could be rescued in up to 
50% of eyes injected with a lentivirus vector containing the murine PDEP gene. In 
contrast with the short-term expression previously obtained with adenovirus vectors, 
PDEP expression in this study persisted for at least 24 weeks. This finding points to 
the potential success of gene therapy in a disease that currently lacks effective 
treatment. See, Takahashi et al., J. Virol, 73:7812-7816 (Sept. 1999) and Amado et 
al. Science, 285:674-676 (1999). 

In nature, the expression of gag, pol, and em of HTV-1 depends on 
the presence of the viral Rev protein. This dependence is, at least in part, due to the 
presence of negatively acting sequences (inhibitory or instability elements [INS]) 
located within unspliced and partially spliced mRNAs. The positive interaction of 
Rev with the Rev-responsive element [RRE] in these mRNAs counteracts the 
negative effects of the inhibitory sequences. 

None of the above references teach or suggest that the gag and/or pol 
genes described therein may be replaced with the gag and/or pol genes in which the 
inhibitory/instability have been mutated to render their expression Rev-idependent. 
Furthermore, there is no disclosure of the specific HTV-1 gaglpol or SIV gag 
mutated genes described herein. 

The gag/pol clone of the invention was made using the method for 
eliminating inhibitory/instability regions from a gene as first described in U.S. 
patent application Serial No. 07/858,747, filed March 27, 1992, (inventors, G. 
Pavlakis and B. Felber) entitled "Method of Eliminating Inhibitory/Instability 
Regions from mRNA" and later described in a Continuation-in-Part ("CDP") 
application, filed as PCT application PCT/US93/02908 on March 29, 1993 and U.S. 
Patent Nos. 5,972,596 and 5,965,726. The disclosure of the CIP application was 
published as International Publication No. WO 93/20212 on October 14, 1993. 
(The disclosures of these patents and patent applications are specifically 
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incorporated by reference herein in their entirety.) The method was also described 
in Schwartz et ah, J. Virol. 66:7176-7182 (1992). 

Schneider et al., J. ViroL 71:4892-4903 (1997), extend the work 
described in the patent applications and in Schwartz et al. by identifying and 
characterizing additional INS within gag, protease and pol genes and mutating them 
in a similar manner. Schneider et al. disclose nucleic acid constructs which contain 
completely mutated HIV-1 gag genes, but only partially mutated HTV-1 pol genes. 

Schneider et al. demonstrate that expression vectors containing an 
intact or nearly intact p55 w region allow the production of immature viral particles 
in mammalian cells in the absence of any other HIV proteins. The introduction of 
additional mutations in the protease region allowed efficient production of 
Gag/protease, which resulted in processing of the Pr55*° g precursor and production 
of mature Gag particles with a lentivirus-like conical-core structure. 

Schneider et al. disclose that Rev-independent expression vectors 
allow the efficient expression of Gag proteins in many cell lines that are not able to 
support efficient Rev-RRE-dependent rescue of these RNAs. Schneider et al. also 
disclose that gag/pol expression vectors may be important for vaccination 
approaches against HIV-1, since the gag/pol region is more conserved than is the 
env region and may be important for an effective immune response against HIV and 
for protection against infection. They also state that efficient HIV gene expression 
in many cells is also of interest for possible gene transfer experiments using 
lentiviral vectors in nondividing or slowly dividing cells, since HIV and the other 
lentiviruses are able to infect quiescent cells. 

Pavlakis et al., Natl Conf Hum Retroviruses Relat Infect (2nd). 
(1995), 91 , state that Rev-independent Gag expression vectors were able to produce 
viral particles in human and mouse cells in the absence of any other HIV proteins, 
and that additional mutations in the pol region allowed the expression of the 
protease and the processing of the p55 gag precursor. Direct DNA injection of TAT 
and Rev independent Gag expression vectors in mouse muscle resulted in Gag 
expression detected by ELIS A and in anti-gag antibody response. Several Rev-and 
Tat- independent Gag expression cassettes were inserted into retroviral vectors and 
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cell lines expressing Gag or Gag fragments that are dominant negative inhibitors of 
HIV-1 were constructed. 

Shiver et al. (1996) describe the results of DNA vaccination of mice 
and non-human primates with mutated plasmid DNA encoding either mutated genes 
encoding HIV-1 gag (p55 gag) or env (gpl20 or gpl60). Both gag and env vaccine 
recipients exhibited antigen-specific cytotoxic and helper T lymphocyte (CTL, Th) 
responses. The results are stated to demonstrate that DNA vaccines elicited long- 
lived T cell responses in both mice and nonhuman primates that were disseminated 
throughout the lymphatics. 

III. SUMMARY OF THE INVENTION 

The invention relates to nucleic acids comprising the nucleic acid 
sequence of the mutated HIV-1 gag/pol gene shown in Figure 1 (SEQUENCE ID 
NO:l) and vectors and vector systems comprising these nucleic acids. 

The invention also relates to nucleic acids comprising the nucleic 
acid sequence of the mutated SIV gag gene shown in Figure 3 and vectors and 
vector systems comprising these nucleic acids. 

The invention also relates to nucleic acids comprising the mutated 
SIV env gene shown in Figure 17 and vectors and vector systems comprising these 

nucleic acids. ' 

The invention also relates to products produced by the nucleic acids, 

e.g., niRNA, protein, and infectious viral particles. 

The invention also relates to compositions comprising these nucleic 

acids and/or their expression products. 

The invention also relates to host cells comprising these nucleic 

acids, vector systems or viral particles. 

The invention also relates to uses of these nucleic acids, vector 
systems, host cells, expression products, and/or compositions to produce mRNA, 
proteins, and/or infectious viral particles, and/or to induce antibodies and/or 
cytotoxic or helper T lymphocytes. 

The invention also relates to the use of these nucleic acid constructs, 
vectors, vector systems and or host cells for use in immunotherapy and 
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immunoprophylaxis, e.g., as a vaccine, or in genetic therapy after expression, 
preferably in humans. The nucleic acid constructs of the invention can include or be 
incorporated into lentiviral vectors or other expression vectors or they may also be 
directly injected into tissue cells resulting in efficient expression of the encoded 
protein or protein fragment. These constructs may also be used for in-vivo or in- 
vitro gene replacement, e.g., by homologous recombination with a target gene in- 
situ. 

IV. BRIEF DESCRIPTION OF THE DRAWINGS 

Fig. 1. DNA sequence of a mutated HTV-1 gag/pol molecular clone 
(SEQUENCE ID NO:l).' The gagpol terminator is located at positions 4305-4397 
ofSEQUENCEIDNO:l. 

Fig. 2. Comparison of the sequence of the wild -type and mutated 
pol region in pCMVgagpolBNkan. Position #1 in the figure is position 2641 in 
plasmid pCMVgagpolBNkan. The comparison starts at position 1 872 from the gag 
initiator ATG. 

Fig. 3. DNA sequence of a mutated SIV gag molecular clone 

(SIVgagDX). 

Fig. 4. Comparison of the mutated SIV gag DNA sequence in 
SIVgagDX with the wild type SIV sequence from Simian (macaque) 
immunodeficiency virus isolate 239, clone lambda siv 239-1 (GenBank accession 
No. M33262). 

Fig. 5. Schematic diagram of some components of sample versions 
of a lentiviral system. BGH poly (A): bovine growth hormone poly (A) signal; 
MSD: mutated splice donor site; \|/: encapsidation signal; SD, splice donor site; 
SA, splice acceptor site; U X" indicates that the ATG codon of the partial gag gene 
sequence is mutated so that translation of this gene does not occur. 

Fig. 6. Schematic diagram of the packaging construct 
pCMVgagpolBNkan. 

Fig. 7. Schematic diagram of transfer construct 1: pmBCwCNluci. 
The packaging signal, the CMV promoter and the coding region for the luciferase 
gene are flanked by the 5' and 3 HTV-1 LTRs, which provide promoter and 
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polyadenylation signals, as indicated by the arrows. Three consecutive arrows 
indicate the U5, R, and U3 regions of the LTR, respectively. The transcribed 
portions of the LTRs are shown in black. Some restriction endonuclease cleavage 
sites are also indicated. 

Fig. 8. Schematic diagram of transfer construct 1: pmBCmCNluci. 
Symbols are as above. 

Fig. 9. DNA sequence of packaging construct pCMVgagpolBNkan. 

Fig. 10. DNA sequence of transfer construct 1 : pmBCwCNluci. 

Fig. 11. DNA sequence of transfer construct 1: pmBCmCNluci. 

Figure 12: 

Fig. 12. Nucleotide sequence of the region BssHII (71 1) to Clal 
(830) in wild-type HTV-1 molecular clones HXB2 and NL4-3, and in the transfer 
constructs. The translation initiator signal for Gag protein (ATG) is underlined. 
pmBCwCNluci and pmBCmCNluci (transfer constructs 1 and 2) contain the 
sequence mBCwCN. Transfer construct 3 contains the sequence m2BCwCN. In 
contrast to the sequence mBCwCN, m2BCwCN has different mutations at the 5' 
splice site region and has an intact Gag ATG. 

Fig. 13. Bar graph showing levels of gag protein that is released 
from cells upon transient transfection with pCMVgagpolBNkan (labeled 
pCMVBNKan in the figure). 

Fig. 14. Bar graph showing reverse transcriptase activity from the 
Rev-independent gag-pol HTV-1 vector pCMVgagpolBNkan (labeled pCMVBNKan 
in the figure). 

Fig. 1 5 . Bar graphs showing the amount of luciferase per nanogram 
of p24 Gag protein detected in cells transducted with PCMVgagpolBNkan Rev- 
independent gag-HIV- 1 based retroviral vectors. The results show that with 
PCMVgagpolBNkan Rev-independent gag-HTV-1 based retroviral vectors display 
high transduction efficiency in (A) 293 cells, (B) human lymphoid cells, (C) human 
myeloid cells (U937), as well as (D) non-dividing cells such as primary human 
macrophages. 

Fig. 16. Schematic diagram of the SIV envelope encoding vector 
CMVkan/R-R-SrVenvCTE. 
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Fig. 17. DNA sequence of the SIV envelope encoding vector 
CMVkan/R-R-SIVenvCTE containing a mutated SIV env gene. 

V. MODES FOR CARRYING OUT THE INVENTION 

It is to be understood that both the foregoing general description and 
the following detailed description are exemplary and explanatory only, and are not 
restrictive of the invention, as claimed. The accompanying drawings, which are 
incorporated in and constitute a part of the specification, illustrate an embodiment of 
the invention and, together with the description, serve to explain the principles of 
the invention. 

One aspect of the invention comprises vectors that encode the Gag 
and/or Pol of HTV-1 in a Rev-independent manner. An example of such a vector 
which is described herein is the plasmid pCMVgagpolBNkan, which encodes the 
complete Gag and Pol of HTV-1 in a Rev-independent manner, and also contains a 
gene conferring kanamycin resistance. This plasmid is Tat and Rev-independent 
and was generated by eliminating the inhibitory/instability sequences present in the 
gag/pol mRNA without altering the amino acid sequence of the proteins coded by 
the genes. 

The gag/pol clone of the invention is a DNA construct of the gag/pol 
region of HIV which has had the inhibitory/instability regions removed. The 
construct is expected to be useful as a component a new type of lentivirus vector for 
use in gene therapy or as a vaccine. 

The gag, pol or gag/pol sequences of the invention can be highly 
expressed in human and other mammalian cells in the absence of any other 
regulatory and structural protein of HIV, including Rev. When the gag/pol 
sequences are combined with a sequence encoding an envelope protein, such as the 
VSV G protein or the HIV envelope protein (e.g., in the same vector or in another 
expression vector), infectious virus is produced after transfection into human cells. 
When a gene encoding a non-HIV envelope protein is used, for example, in the 
presence of the HIV gag/pol gene, the virus particles produced would contains only 
the HIV proteins Gag and Pol. 
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Lenti viral vectors or vector systems based on the gag y pol or gag/pol 
sequences of this invention, as exemplified by the Rev-independent pCMVgagpol 
BNkan construct described herein, may be used for gene therapy in vivo (e.g., 
parenteral inoculation of high titer vector) or ex vivo (e.g., in vitro transduction of 
patient's cells followed by reinfusion into the patient of the transduced cells). These 
procedures are been already used in different approved gene therapy protocols. 

The HIV gag/pol clone and SIV gag clone of the invention were 
made using the method for eliminating inhibitory/instability regions from a gene as 
described in U.S. patent application Serial No. 07/858,747, filed March 27, 1992, 
and also in related U.S. Patent Nos. 5,972,596 and 5,965,726, which are 
incorporated by reference herein. This method does not require the identification of 
the exact location or knowledge of the mechanism of function of the INS. 
Generally, the mutations are such that the amino acid sequence encoded by the 
mRNA is unchanged, although conservative and non-conservative amino acid 
substitutions are also envisioned where the protein encoded by the mutated gene is 
substantially similar to the protein encoded by the non-mutated gene: The mutated 
genes can be synthetic (e.g., synthesized by chemical synthesis), semi-synthetic 
(e.g., a combination of genomic DNA, cDNA, or PCR amplified DNA and synthetic 
DNA), or recombinantly produced. The genes also may optionally not contain 
introns. The nucleic acids of the invention may also contain Rev-independent 
fragments of these genes which retain the desired function (e.g., for antigenicity of 
Gag or Pol, particle formation (Gag) or enzymatic activity (Pol)), or they may also 
contain Rev-independent variants which have been mutated so that the encoded 
protein loses a function that is unwanted in certain circumstances. In the latter case, 
for example, the gene may be modified to encode mutations (at the amino acid 
level) in the active site of reverse transcriptase or integrase proteins to prevent 
reverse transcription or integration. Rev-independent fragments of the gag gene are 
described in U.S. patent application Serial No. 07/858,747, filed March 27, 1992, 
and also in related U.S. Patent Nos. 5,972,596 and 5,965,726, which are 
incorporated by reference herein. 

In addition to being capable of producing HTV Gag and Pol proteins 
in the absence of Rev regulatory protein in a cell in vivo, the HIV gag/pol clone and 
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SIV gag clone of the invention are also capable of producing HIV Gag and Pol 
proteins in the absence of any added cis acting transport element, such as CTE or 
CTE-like elements (collectively refered herein as RNA Transport Elements (RTE)). 
Experiments indicate that the mutated vectors of the invention for SIV gag are far 
superior to those adding CTE (see Qiu et al, J Virol. 73:9145-52 (1999)). 

The expression of the proteins encoded by these vectors after 
transfection into human cells may be monitored at both the level of RNA and 
protein production. RNA levels are quantitated by methods known in the art, e.g., 
Northern blots, SI mapping or PCR methods. Protein levels may also be 
quantitated by methods known in the art, e.g., western blot or ELISA or fluorescent 
detection methods. A fast non-radioactive ELISA protocol can be used to detect 
gag protein (DUPONT or COULTER gag antigen capture assay). 

At least three types of lentiviral vectors based on the gag/pol genes 
of the invention for use in gene therapy and/or as a vaccine are envisioned, i.e., 
lentiviral vectors having 

a) no round of replication (i.e., a zero replication system) 

b) one round of replication 

c) a fully replicating system 

For a system with no round of replication, a gag/pol gene, or separate 
gag and pol genes, or fragments of these genes, expressed using appropriate 
transcription units, e.g., a CMV promoter and a BGH poly (A) site. This will allow 
expression of the gag/pol unit (or gag or pol or fragments) thereof) for vaccine 
purposes. This expression can be accomplished without the production of any 
functional retroviral enzymes, provided that the appropriate mutation(s), e.g., a 
missense mutation, are introduced. In a zero replication system, a virus stock will 
be administered to the cells or animals of interest. For example, if one creates and 
uses a virus stock with the exemplified system using the packaging vector 
PCMVgagpolBNKan, the transfer construct pmBCwCNluci or pmBCmCNluci, and 
the envelope containing vector pHCMV-G, one obtains a zero replication system. 
The virus particles produced by such system can infect cells, and the reverse 
transcribed transfer construct DNA will go into the nucleus but, because the coding 
regions for viral structural proteins are not present, there will be no virus expression 



WO 01/46408 



PCT/US00/34985 



-17- 

and replication (0 rounds). If one transfects cells in vivo with the same 3 DNAs, 
they will go to the nucleus, express viral proteins, make infectious virus particles 
and go out and infect another cell or cells (1 round). Since in vivo delivery of three 
plasmids may result in lower expression, at least two different embodiments are 
envisioned. In the first, two plasmids may be used, e.g., MV1 shown in Fig. 5 and 
an envelope expression plasmid such as pHCMV-G. Other plasmids encoding 
functional envelopes from HIV, SIV, or other retroviruses can also be used. 
Transfection by the two plasmids results in infectious virus that can infect and 
integrate into new cells (1 round). The infected cells produce gagpol but virus 
propagation is not possible in the absence of env. . 

For a system with one round of replication, at least two additional 
embodiments are envisioned. In the first method, a combination of the genes, e.g., a 
gag/pol gene, an env encoding gene and, preferably, a gene encoding a reporter 
protein or other polynucleotide or protein of interest, are delivered into the cells of 
interest in vivo. As discussed above for the exemplified system, if one transfects 
cells in vivo with the same 3 DNAs, they will go to the nucleus, express viral 
proteins, make infectious virus particles, be released and infect another cell or cells 
(1 round). 

In another embodiment, the same result (i.e., only one round of 
replication) can be obtained by using transfer vectors that have deletions in the 3' 
LTR and in which a heterologous-promoter (e.g., the CMV-promoter, or inducible 
promoter, or tissue-specific promoter), is used in place of the '3' LTR promoter. 
The mutations in the 3' LTR making it inactive upon reverse transcription and 
integration. This is because the integrated provirus derives both its 5' LTR and its 3' 
LTR from the 3' LTR of the starting (transfer) construct. (This is a well-known 
property of all retroviruses and has been used to make self-inactivating vectors 
(SIN)). There are several reasons one may want to inactivate the incoming LTR 
promoter, one of which is to use a different tissue specific or regulated promoter for 
expression of a gene of interest in the integrated provirus. Note that, with SIN 
vectors, if one uses a viral stock made in vitro after transfection into cells and 
collection of infectious virus, there will be no round of replication. If one transfects 
cells with the DNAs in vivo, there will be one round of replication. If functional 
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gag, pol 9 or env are not included in the DNA mix, there will not be any infection at 
all (i.e., infectious viruses will not be made). 

A fully replicating Rev-independent system has not been constructed 
yet, although it is expected that a functional system can be constructed using Rev- 
independent gag/pol and env sequences. If desired, extra posttranscriptional control 
elements such as the CTE element, which can replace Rev and give infectious virus 
(see e.g., Zolotukhin et al., J.Virol.68:944-7952 (1994)) are included. The fully 
replicating system should be in one piece, containing the LTR, packaging signal, 
gag/pol, splice site, env, tat, one or more CTE or CTE-like elements (if desired for 
optimal results), and LTR. Tat is thought to be required in this construct, at least in 
non-permissive cells. Such a system is depicted in Figure 5, (construct MV2). In 
this system, a cell or animal of interest (preferably human) would be infected with 
virus stock that then propagates. CTE or CTE-like elements (depicted in construct 
MV2 as RTE (RNA Transport Elements)) are desirable since they have been shown 
to improve expression, and since many retroviruses require the presence of 
posttranscriptional control elements. There are several types of CTE and CTE-like 
elements, and these elements appear to work via a different pathway from the Rev- 
RRE pathway. See, e.g., Tabernero et al., J Virol. 71:95-101 (1997). See also, 
Pavlakis and Nappi, PCT/US99/1 1082, filed May 22, 1999, published as WO 
99/61596 on December 2, 1999 (and incorporated herein by reference), which 
describes a new type of post-transcriptional control element that is able to replace 
CTE and HTV RRE/Rev. The Pavlakis-Nappi element does not work in the same 
way as CTE and does not have any sequence or structure homology. 

In a preferred embodiment, a lentiviral system of the invention 
comprises the following three components: 

1 . a packaging vector containing nucleic acid sequences 

encoding the elements necessary for vector packaging such as 
structural proteins (except for HIV env) and the enzymes 
required to generate vector particles, the packaging vector 
comprising at least a mutated HTV or SIV gag/pol gene of the 
invention; 
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2. a transfer vector containing genetic cis-acting sequences 
necessary for the vector to infect the target cell and for 
transfer of the therapeutic or reporter or other gene(s) of 
interest, the transfer vector comprising the encapsidation 
signal and the gene(s) of interest or a cloning site for inserting 
the gene(s) of interest; 

and 

3 . a vector containing sequences encoding an element necessary 
for targeting the viral particle to the intended recipient cell, 
preferably the gene encoding the G glycoprotein of the 
vesicular stomatis virus (VSV-G) or amphotrophic MuLV or 
lentiviral envs. 

Using the CMV promoter or other strong, high efficiency, promoter 
instead of the HIV-1 LTR promoter in the packaging vector, high expression of gag, 
pol, or gag/pol can be achieved in the total absence of any other viral protein. The 
exchange of the HIV-1 LTR promoter with other promoters is beneficial in the 
packaging vector or other vectors if constitutive expression is desirable and also for 
expression in other mammalian cells, such as mouse cells, in which the HIV-1 
promoter is weak. Vectors containing the sequences of the invention can be used 
for the Rev independent production of HIV-1 Gag/Pol, HIV-1 Gag, HIV-1 Pol, and 
SIV Gag proteins. In certain embodiments, the presence of heterologous promoters 
will also be desired in the transfer vector and the envelope encoding vector, when 
such vectors are used. 

The gene(s) of interest are chosen according to the effect sought to be 
achieved. For gene therapy purposes there will be at least one therapeutic gene 
encoding a gene product which is active against the condition it is desired to treat or 
prevent. Alternatively or additionally, there may be a gene which acts as a marker 
by encoding a detectable product. Therapeutic genes may encode, for example, an 
anti-sense RNA, a ribozyme, a transdominant negative mutant of a target protein, a 
toxin, a conditional toxin, an antigen that induces antibodies or helper T-cells or 
cytotoxic T-cells, a single chain antibody or a tumor suppresser protein. See, e.g., 
WO 98/17816. 
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An even more extensive list of genes of interest for use in lentiviral 
vectors is described, e.g., in WO 99/04026 on page 10, line 20 to page 12, line 7. 
Table 2 of Klimatcheva et al. (1999) also provides a list of disorders and target cells 
for gene therapy, as well as a number of lentiviral vectors used by others. This list 
includes genetic/metabolic deficiencies, viral infection and cancer. Inherited 
genetic defects such as adenosine deaminase deficiency, familial 
hypercholesterolemia, cystic fibrosis, mucopolysaccharidosis type VII, types I and 
II diabetes, classical phenylketonuria and Gaucher disease are diseases which are 
listed as being possible to overcome by lentiviral vector-mediated gene therapy 
because they constitute single-gene deficiencies for which the involved genes are 
known. Viral diseases are also listed as constituting appropriate targets for lentiviral 
gene delivery. In particular, a number of gene therapy approaches have been 
proposed for the treatment of HIV infection and, for some of these strategies, phase 
I studies have recently begun in humans. The article states that preliminary studies 
have dealt with defective murine oncoviruses for delivery of anti-sense RNAs, 
ribozymes and trans-dominant proteins against HIV replication. 

In any of the vectors, but preferably in the transfer vector, an inserted 
gene could have an internal ribosomal entry site (IRES), e.g., from picornaviral 
RNA. An IRES will be used in circumstances that one wants to express two 
proteins from the same promoter. For example one protein of interest and a marker 
gene, e.g., green fluorescent protein (GFP) or a marker gene and a drug resistance 
gene (e.g. the firefly luciferase gene and neomycin phosphotransferase gene) as 
described on p. 58 of WO 99/04026, for example. Using an IRES the expression of 
the two proteins is coordinated. A further gene or genes may also be present under 
the control of a separate promoter. Such a gene may encode for example a 
selectable marker, or a further therapeutic agent which may be among the 
therapeutic agents listed above. Expression of this gene may be constitutive; in the 
case of a selectable marker this may be useful for selecting successfully transfected 
packaging cells, or packaging cells which are producing particularly high titers of 
the retroviral vector particles. Alternatively or additionally, the selectable marker 
may be useful for selecting cells which have been successfully infected with the 
lentiviral vector and have the provirus integrated into their own genome. 
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One way of performing gene therapy is to extract cells from a 
patient, infect the extracted cells with a lenti viral vector and reintroduce the cells 
back into the patient A selectable marker may be used to provide a means for 
enriching for infected or transduced cells or positively selecting for only those cells 
which have been infected or transduced, before reintroducing the cells into the 
patient. This procedure may increase the chances of success of the therapy. 
Selectable markers may be for instance drug resistance genes, metabolic enzyme 
genes, or any other selectable markers known in the art. Typical selection genes 
encode proteins that confer resistance to antibiotics and other toxic substances, e.g., 
histidinol, puromycin, hygromycin, neomycin, methotrexate etc. and cell surface 
markers. 

However, it will be evident that for many gene therapy applications 
of lenti viral vectors, selection for expression of a marker gene may not be possible 
or necessary. Indeed expression of a selection marker, while convenient for in vitro 
studies, could be deleterious in vivo because of the inappropriate induction of 
cytotoxic T lymphocytes (CTLs) directed against the foreign marker protein. Also, 
it is possible that for in vivo applications, vectors without any internal promoters 
will be preferable. The presence of internal promoters can affect for example the 
transduction titres obtainable from a packaging cell line and the stability of the 
integrated vector. Thus, single transcription unit vectors, which may be bi-cistronic 
or poly-cistronic, coding for one or two or more therapeutic genes, may be the 
preferred vector designed for use in vivo. See, e.g., WO 98/17816. 

Suitable host or producer cells for use in the invention are well 
known in the art. May lentiviruses have already been split into replication defective 
genomes and packaging components. For those which have not the technology is 
available for doing so. The producer cell encodes the viral components not encoded 
by the vector genome such as the Gag, Pol and Env proteins. The gag,pol and env 
genes may be introduced into the producer cell transiently, or may be stably 
integrated into the cell genome to give a packaging cell line. The lentiviral vector 
genome is then introduced into the packaging cell line by transfection or 
transduction to create a stable cell line that has all of the DNA sequences required to 
produce a lentiviral vector particle. Another approach is to introduce the different 
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DNA sequences that are required to produce lentiviral vector particle, e.g., the env 
coding constrict, the gag-pol coding construct and the transfer construct into the cell 
simultaneously by transient triple transfection. 

Target cells identified by Klimatcheva et al. (1 999), and the 
references cited therein, include airway epithelial cells for cystic fibrosis; retinal 
photoreceptor cells for retinitis pigmentosa; progenitors for red blood cells, 
macrophages, and lymphocytes for hematopoietic disorders, sickle cell anemia, fl- 
thalassemia, lysosomal storage disorders, mucopolysaccharidoses, and severe 
combined immunodeficiency syndrome; bone marrow cells and macrophages for 
Gaucher's disease; liver cells for familial hypercholesterolaemia; T-Iymphocytes 
and macrophages for HIV infection; brain tissue, neurons, and glial cells for 
neurodegenerative diseases such as Parkinson's and Alzheimer's diseases; 
endothelial cells and cardiac myocytes for cardiovascular diseases; and cancer cells 
in various tissues (e.g. liver or brain) for cancer. Target cells for other diseases 
would be apparent to one of skill in the art. 

Vaccines and pharmaceutical compositions comprising at least one of 
the nucleic acid sequences, vectors, vector systems, or transduced or transfected 
host cells of the invention and a physiologically acceptable carrier are also part of 
the invention. 

As used herein, the term "transduction" generally refers to the 
transfer of genetic material into the host via infection, e.g., in this case by the 
lentiviral vector. The term "transfection" generally refers to the transfer of isolated 
genetic material into cells via the use of specific transfection agents (e.g., calcium 
phosphate, DEAE Dextran, lipid formulations, gold particles, and other 
microparticles) that cross the cytoplasmic membrane and deliver some of the 
genetic material into the cell nucleus. 

Systems similar to those described herein can be produced using 
elements of lentiviruses in addition to the HTV and/or SIV genes described herein. 

Pharmaceutical Compositions 

The pharmaceutical compositions of the invention contain a 
pharmaceutically and/or therapeutically effective amount of at least one nucleic acid 
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construct, vector, vector system, viral particle/virus stock, or host cell (i.e., agents) 
of the invention. In one embodiment of the invention, the effective amount of an 
agent of the invention per unit dose is an amount sufficient to cause the detectable 
expression of the gene of interest. In another embodiment of the invention, the 
effective amount of agent per unit dose is an amount sufficient to prevent, treat or 
protect against deleterious effects (including severity, duration, or extent of 
symptoms) of the condition being treated. The effective amount of agent per unit 
dose depends, among other things, on the species of mammal inoculated, the body 
weight of the mammal and the chosen inoculation regimen, as is well known in the 
art. The dosage of the therapeutic agents which will be most suitable for 
prophylaxis or treatment will also vary with the form of administration, the 
particular agent chosen and the physiological characteristics of the particular patient 
under treatment. The dose is administered at least once. Subsequent doses may be 
administered as indicated. 

To monitor the response of individuals administered the 
compositions of the invention, mRNA or protein expression levels may be 
determined. In many instances it will be sufficient to assess the expression level in 
serum or plasma obtained from such an individual. Decisions as to whether to 
administer another dose or to change the amount of the composition administered to 
the individual may be at least partially based on the expression levels. 

The term "unit dose" as it pertains to the inocula refers to physically 
discrete units suitable as unitary dosages for mammals, each unit containing a 
predetermined quantity of active material (e.g., nucleic acid, virus stock or host cell) 
calculated to produce the desired effect in association with the required diluent The 
titers of the virus stocks to be administered to a cell or animal will depend on the 
application and on type of delivery (e.g., in vivo or ex vivo). The virus stocks can be 
concentrated using methods such as centrifugation. The titers to be administered ex 
vivo are preferably in the range of 0.001 to 1 infectious unit /cell. Another method 
of generating viral stocks is to cocultivate stable cell lines expressing the virus with 
the target cells. This method has been used to achieve better results when using 
traditional retroviral vectors because the cells can be infected over a longer period 
of time and they have the chance to be infected with multiple copies of the vector. 
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For in vivo administration of nucleic acid constructs, vectors, vector 
systems, virus stocks, or cells which have been transduced or transfected ex vivo, the 
dose is to be determined by dose escalation, with the upper dose being limited by 
the onset of unacceptable adverse effects. Preliminary starting doses may be 
extrapolated from experiments using lentiviral vectors in animal models, by 
methods known in the art, or may be extrapolated from comparisons with known 
retroviral (e.g., adenoviral) doses. Generally, small dosages will be used initially 
and, if necessary, will be increased by small increments until the optimum effect 
under the circumstances is reached. Exemplary dosages are within the range of 10 
up to approximately 5 x 10 15 particles. 

Inocula are typically prepared as a solution in a physiologically 
acceptable carrier such as saline, phosphate-buffered saline and the like to form an 
aqueous pharmaceutical composition. 

The agents of the invention are generally administered with a 
physiologically acceptable carrier or vehicle therefor. A physiologically acceptable 
carrier is one that does not cause an adverse physical reaction upon administration 
and one in which the nucleic acids are sufficiently soluble to retain their activity to 
deliver a pharmaceutically or therapeutically effective amount of the compound. 
The pharmaceutically or therapeutically effective amount and method of 
administration of an agent of the invention may vary based on the individual patient, 
the indication being treated and other criteria evident to one of ordinary skill in the 
art. A therapeutically effective amount of a nucleic acid of the invention is one 
sufficient to prevent, or attenuate the severity, extent or duration of the deleterious 
effects of the condition being treated without causing significant adverse side 
effects. The route(s) of administration useful in a particular application are apparent 
to one or ordinary skill in the art. 

Routes of administration of the agents of the invention include, but 
are not limited to, parenteral, and direct injection into an affected site. Parenteral 
routes of administration include but are not limited to intravenous, intramuscular, 
intraperitoneal and subcutaneous. The route of administration of the agents of the 
invention is typically parenteral and is preferably into the bone marrow, into the 
CSF intramuscular, subcutaneous, intradermal, intraocular, intracranial, intranasal, 
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and the like. See, e.g., WO 99/04026 for examples of formulations and routes of 
administration. 

The present invention includes compositions of the agents described 
above, suitable for parenteral administration including, but not limited to, 
pharmaceutical^ acceptable sterile isotonic solutions. Such solutions include, but 
are not limited to, saline and phosphate buffered saline for nasal, intravenous, 
intramuscular, intraperitoneal, subcutaneous or direct injection into a joint or other 
area. 

In providing the agents of the present invention to a recipient 
mammal, preferably a human, the dosage administered will vary depending upon 
such factors as the mammal's age, weight, height, sex, general medical condition, 
previous medical history and the like. 

The administration of the pharmaceutical compositions of the 
invention may be for either "prophylactic" or "therapeutic" purpose. When 
provided prophylactically, the compositions are provided in advance of any 
symptom. The prophylactic administration of the composition serves to prevent or 
ameliorate any subsequent deleterious effects (including severity, duration, or extent 
of symptoms) of the condition being treated. When provided therapeutically, the 
composition is provided at (or shortly after) the onset of a symptom of the condition 
being treated. 

For all therapeutic, prophylactic and diagnostic uses, one or more of 
the agents of the invention, as well as antibodies and other necessary reagents and 
appropriate devices and accessories, may be provided in kit form so as to be readily 
available and easily used. 

Where immunoassays are involved, such kits may contain a solid 
support, such as a membrane (e.g., nitrocellulose), a bead, sphere, test tube, rod, and 
so forth, to which a receptor such as an antibody specific for the target molecule will 
bind. Such kits can also include a second receptor, such as a labeled antibody. Such 
kits can be used for sandwich assays to detect toxins. Kits for competitive assays 
are also envisioned. 
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VI. INDUSTRIAL APPLICABILITY 

Mutated genes of this invention can be expressed in the native host 
cell or organism or in a different cell or organism. The mutated genes can be 
introduced into a vector such as a plasmid, cosmid, phage, virus or mini- 
chromosome and inserted into a host cell or organism by methods well known in the 
art. In general, the mutated genes or constructs containing these mutated genes can 
be utilized in any cell, either eukaryotic or prokaryotic, including mammalian cells 
(e.g., human (e.g., HeLa), monkey (e.g., Cos), rabbit (e.g., rabbit reticulocytes), rat, 
hamster (e.g., CHO and baby hamster kidney cells) or mouse cells (e.g., L cells), 
plant cells, yeast cells, insect cells or bacterial cells (e.g., E. coli). The vectors 
which can be utilized to clone and/or express these mutated genes are the vectors 
which are capable of replicating and/or expressing the mutated genes in the host cell 
in which the mutated genes are desired to be replicated and/or expressed. See, e.g., 
F. Ausubel et al., Current Protocols in Molecular Biology. Greene Publishing 
Associates and Wiley- Interscience (1992) and Sambrook et al. (1989) for examples 
of appropriate vectors for various types of host cells. The native promoters for such 
genes can be replaced with strong promoters compatible with the host into which 
the gene is inserted. These promoters may be inducible. The host cells containing 
these mutated genes can be used to express large amounts of the protein useful in 
enzyme preparations, pharmaceuticals, diagnostic reagents, vaccines and 
therapeutics. 

Mutated genes or constructs containing the mutated genes may also 
be used for in-vivo or in-vitro gene therapy. For example, a mutated gene of the 
invention will produce an mRNA in situ to ultimately increase the amount of protein 
expressed. Such gene include viral genes and/or cellular genes. Such a mutated 
gene is expected to be useful, for example, in the development of a vaccine and/or 
genetic therapy. 

The constructs and/or proteins made by using constructs encoding 
the mutated gag, env, and pol genes could be used, for example, in the production of 
diagnostic reagents, vaccines and therapies for ADDS and AIDS related diseases. 
The inhibitory/instability elements in the HIV-1 gag gene may be involved in the 
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establishment of a state of low virus production in the host. HTV-1 and the other 
Antiviruses cause chronic active infections that are not cleared by the immune 
system. It is possible that complete removal of the inhibitory/instability sequence 
elements from the lentiviral genome would result in constitutive expression. This 
could prevent the virus from establishing a latent infection and escaping immune 
system surveillance. The success in increasing expression of the entire gag/pol gene 
by eliminating the inhibitory sequence element suggests that one could produce 
Antiviruses without any negative elements. Such Antiviruses could provide a novel 
approach towards attenuated vaccines. 

For example, vectors expressing high levels of Gag can be used in 
immunotherapy and immunoprophylaxis, after expression in humans. Such vectors 
include retroviral vectors and also include direct injection of DNA into muscle cells 
or other receptive cells, resulting in the efficient expression of gag, using the 
technology described, for example, in Wolff et al., Science 247:1465-1468 (1990), 
Wolff et al., Human Molecular Genetics l(6):363-369 (1992) and Ulmer et al., 
Science 259:1745-1749 (1993). Further, the gag constructs could be used in 
transdominant inhibition of HIV expression after the introduction into humans. For 
this application, for example, appropriate vectors or DNA molecules expressing 
high levels of ip55 m or p37 gag would be modified to generate transdominant gag 
mutants, as described, for example, in Trono et al., Cell 59:113-120 (1989). The 
vectors would be introduced into humans, resulting in the inhibition of HIV 
production due to the combined mechanisms of gag transdominant inhibition and of 
immunostimulation by the produced gag protein. In addition, the gag constructs of 
the invention could be used in the generation of new retroviral vectors based on the 
expression of lentiviral gag proteins. Lentiviruses have unique characteristics that 
may allow the targeting and efficient infection of non-dividing cells. Similar 
applications are expected for vectors expressing high levels of env. 

Identification of similar inhibitory/instability elements in SIV 
indicates that this virus is a convenient model to test these hypotheses. SIV 
similarly modified could be used in place of HTV in an effort to further minimize the 
possibility of rearrangement events that would lead to the generation of infectious 
HIV. 
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The following examples illustrate certain embodiments of the present 
invention, but should not be construed as limiting its scope in any way. Certain 
modifications and variations will be apparent to those skilled in the art from the 
teachings of the foregoing disclosure and the following examples, and these are 
intended to be encompassed by the spirit and scope of the invention. 

EXAMPLE 1 
Rev-Independent HIV-1 Gag/Pol Molecular Clone 

Figure 1 shows the DNA sequence of a Rev-independent HIV-1 
gag/pol molecular clone. This DNA sequence shown encodes the complete Gag and 
Pol of HIV-1 and can be expressed in a Rev-independent manner when operably 
linked to a promoter. The Rev-independent gag sequence was described in U.S. 
Patent Nos. 5,972,596 and 5,965,726 and the Rev-independent pol sequence was 
generated by eliminating the inhibitory/instability sequences using the methods 
described in U.S. Patent Nos. 5,972,596 and 5,965,726. Others have reportedly 
made Rev independent gag sequences by optimizing codon usage for human cells 
(see, e.g., WO 98/34640). 

Figure 2 shows an alignment of the sequence of the wild - type and 
mutated pol region in pCMVgagpolBNkan. Position #1 in the figure is position 
2641 in plasmid pCMVgagpolBNkan. 

The elimination of INS in gag, pol and env regions allows the 
expression of high levels of authentic HIV-1 structural proteins in the absence of the 
Rev regulatory factor of HTV- 1 . 

EXAMPLE 2 

Rev-Independent SIV Gag Molecular Clone 

Figure 3 shows the DNA sequence of a Rev-independent SIV gag 
molecular clone, SIVgagDX. Figure 4 shows the comparison of wild type (WT) 
and mutant (SIVgagDX) sequences. The wild type SIV sequence is from Simian 
(macaque) immunodeficiency virus isolate 239, clone lambda siv 239-1 (GenBank 
accession No. M33262). 
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EXAMPLE 3 
Rev-Independent SIV Env Molecular Clone 

Figure 16 shows a schematic diagram, and figure 17 shows the DNA 
sequence, of the "env-coding" vector CMVkan/R-R-SIVenvCTE, which is an 
example of a vector comprising a mutated lentiviral env gene sequence which is 
capable of being expressed independently of any SIV or HIV regulatory factors. 
"CMV" denotes the cytomegalovirus promoter; "SRV-CTE" denotes the 
constitutive transport element (CTE) of Simian Retrovirus Type 1 ; "all-STOP" 
denotes a sequence providing translational stops in all three reading frames; "BGH 
terminator" denotes the bovine growth hormone polyadenylation signal. Other 
posttranscriptional control elements can be used'instead of the indicated SRV-CTE, 
for example the one described by Pavlakis and Nappi, PCT/US99/1 1082, filed May 
22, 1999, which was published as WO 99/61596 on December 2, 1999 (and which 
is incorporated herein by reference). 

As mentioned previously above, such a vector encoding a lentiviral 
env gene may be used if it is desired that the vector infect CD4 + T cells. Also as 
mentioned previously above, the CTE element (i.e., the SRV-CTE element in the 
case of vector CMVkan/R-R-SIVenvCTE), can be replaced with another post- 
transcriptional control element, such as the Pavlakis-Nappi element, that is able to 
replace CTE and fflV RRE/Rev. See Pavlakis and Nappi, PCT/US99/1 1082, filed 
May 22, 1999, which was published as WO 99/61596 on December 2, 1999 (and 
which is incorporated herein by reference). 

EXAMPLE 4 

Lentivirial Vector System 

Figure 5 is a schematic of some of the components of a preliminary 
version of the Rev-independent lentiviral vector system exemplified herein, 
including a packaging construct and three different transfer vectors which may be 
used. In the lentiviral system exemplified herein, the packaging construct also 
contains the gene for kanamycin resistance. The lentiviral system exemplified 
herein also contains the vector pHCMV-G, which is shown in Figure 5. 
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In the packaging construct shown in Figure 5, "CMV" denotes the 
cytomegalovirus promoter, "Gag" denotes the gag gene, which generates 
components of the virion core, "Pro" denotes "protease" "RT" denotes "reverse 
transcriptase," 'Int" denotes "integrase" and "BGH poly (A)" denotes the bovine 
growth hormone polyadenylation signal. The protease, reverse transcriptase, and 
integrase genes comprise the "pol" gene. In transfer construct 1, "LTR" denotes the 
HIV "long terminal repeat", which contains a HIV promoter; "mSD" denotes 
"mutated splice donor site," which is present in the construct so that splicing of the 
RNA transcript does not occur; ' V denotes the encapsidation signal; "wGA" 
denotes part of the wild-type gag gene which contains sequences believed to be 
necessary for encapsidation; "X" indicates that the ATG codon of the partial gag 
gene sequence is mutated so that translation of this gene does not occur; "CMV" 
denotes the cytomegalovirus promoter and luciferase is used as a reporter gene. 
Luciferase can be replaced with any gene of interest. Another HIV LTR is present 
at the 3 ' end of transfer construct 1 . Replacement of this LTR in constructs such as 
the transfer construct 1 , 2, or 3 with a promoter-enhancer deleted HIV LTR leads to 
inactivation of LTR after integration. Transfer construct 2 is similar to transfer 
construct 1 , the difference being that a mutated part of the gag gene (denoted 
"mGa") is used instead of the wild-type part of the gag gene. Transfer construct 3 
(pm2BCwCNluci) has different mutations at the 5' splice site and has an intact ATG 
codon so that translation of part of the mutated gag gene occurs. Transfer construct 
3 also has a 5' CMV promoter instead of a 5' LTR promoter. This construct is 
expressed independent of the presence of HTV Tat protein. The transfer constructs 
expressed from the LTR promoter are partially dependent on Tat protein. In 293 
cells significant expression can be achieved in the absence of Tat. See, e.g., 
Valentin et al., Proc. Natl Acad. Sci. USA. 95:8886-91 (1988). 

EXAMPLE 5 

Generation of Packaging Construct pCMVgagpol BNkan 

Figure 6 shows a schematic map of the packaging construct pCMV 
gagpolBNKan. The nucleotide numbering is that of the HXB2R sequence 
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(Genbank accession number K03455 and M38432), where +1 is the start of 
transcription. 

The sequence in HTV-1 gag/pol region was mutated in order to 
eliminate all the INS. The fragment from the beginning of gag to BsrGI site in pol, 
and the fragment KE [Kpnl(3700)- EcoRI(4194)] were previously mutated 
described in Schneider et al, J Virol. 71 : 4892-4903 (1997) and in U.S. Patent Nos. 
5,972,596 and 5,965,726. 

To generate pCMVgagpolBNkan, three fragments within HIV-1 pol 
region were mutated. They are fragment BP [BsrGI(2207)-PflMI(3032)], fragment 
PK [PflMI(3032)-KpnI(3700)] and fragment EN [EcoRI(4194)-NdeI(4668)]. 
Mutagenesis was performed using a modified version of the method described by 
Ho et al., Gene 77: 51-59 (1989) and DNA shuffling (Zhao and Arnold, Nucl. Acid 
Res. 25(6), 1307-1308 (1997). Sixteen oligonucleotides extending over the 
complete sequence of the three fragments were designed. Six oligos corresponded 
to fragment BP, six to fragment PK, and four to fragment EN (the oligonucleotides 
ranged from 130 to 195 bases in length; adjacent oligos overlapped by twenty 
nucleotides). Each fragment was assembled in two steps: 

1) PCR; the reaction was carried out in standard pfu buffer with 
1 0 pmol of each purified big oligo, 0.2 mM of each dNTPs and 2.5 u pju DNA 
polymerase enzyme (Stratagene) in a 50 \A final volume. The PCR program was: 3 
min 96°C followed by 50 cycles of 1 min 94°C, 1 min 55°C, and 1 min + 5 s/cycle 
72°C, ended by 7 min at 72°C. After PCR, the big oligonucleotides were removed 
from the assembled mutated fragment. 

2) The second step was to specifically amplify the assembled 
products with 30 mer primers located at the 5 1 and 3 1 end of each mutated fragment. 
One microliter of the assembled PCR product was used as template in a 25-cycle 
PCR reaction with 50 pmol of each primer, 1 x pfu buffer, 0.2 mM of each dNTP 
and 2.5 u pfu DNA polymerase in a 50 jj! final volume. The PCR program was: 3 
min 96°C, 10 cycles of 30 s 94°C, 30 s 55°C, 45 s 72°C, followed by another 14 
cycles of 30 s 94°C, 30 s 55°C, 45 s + 20 s/cycle 72°C, and finally 7 min 72°C. 
This program gave a single PCR product of the correct size. The amplified BP, PK 
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and EN fragments were individually cloned into PCR-script™ vector using PCR- 
script™ Amp SK(+) Cloning Kit (Stratagene). Clones were randomly selected and 
sequenced. The correct BP, PK and EN fragments together with fragment KE 
previously mutated by Schneider et al. were ligated between BsrGI and Kpnl site of 
p55AMl-R5 (which was previously described in Schneider et al., J. Virol. 71: 4892- 
4903 (1997)) to produce a completely mutated gagpol ORF. The new plasmid 
containing the completely mutated gag/pol was named pLTRgagpolBN. BN stands 
for the modification of the fragment between BsrGI and Ndel. The mutated gag/pol 
was then cloned into a CMVkan vector containing the cytomegalovirus major late 
promoter (GenBank accession no. X17403) and the kanamycin resistance gene, 
resulting in pCMVgagpolBNkan. The plasmid backbone comes from pVR1332 
provided by Vical Inc., and described in Hartikka et al, Hum Gene Ther. 7: 1205-17 
(1996). 

It is understood that different plasmid backbones can be used, e.g., to 
provide good expression in vivo, in the case of DNA injection, for example. 

EXAMPLE 6 

Construction of Transfer Vectors pmBCwCNluci and pmBCmCNluci 

The HTV-1 sequence BC, between BssHII (257) and Clal (376), 
contains the major splice donor site and the encapsidation signal. Six oligos (33 to 
46 bases) were designed to introduce mutations on the splice donor site and the 
AUG start codon of gag. The BC fragment was assembled, amplified and 
sequenced as described in the section concerning the construction of 
pCMVgagpolBN. 

The mutated BC fragment and a fragment of wild type gag between 
Clal (376) and Nsi (793) were placed between the BssHII and Nsi sites of p55RRE 
(Schneider et al., J. Virol. 71:4892-4903 (1997)) to generate pmBCwCN. In 
parallel, the fragment between Clal (376) and Nsil sites of mutated gag from 
p55BMl-10SD+ was used to generate pmBCmCR (p55BMl-10SD+ is similar to 
p55BMl-10, which is described in Schneider et al. (1997), but contains in addition 
the intact splice donor and encapsidation site upstream of gag). The region between 



01/46408 



PCT/US00/34985 



-33- 

Nsil and Xhol containing 3' part of gag and RRE in pmBCwCN and pmBCmCN 
was replaced by a Clal-Xhol fragment containing CMV promoter and luciferase 
gene from pHR'-CMVluci (vector from D. Trono) to generate pmBCwCNluci and 
pmBCmCNluci (which are shown as transfer constructs 1 and 2 in Figure 5, and 
schematically depicted in Figures 7 and 8, respectively). The sequences of these 
plasmids are shown in Figures 10 and 1 1, respectively. Different versions of these 
plasmids have also been created, by standard procedures, with variations in the 
region of the encapsidation site, the first splice donor site, and the initiator gag 
AUG. For example, the transfer construct pm2BcwCNluci (which is shown as 
transfer construct 3 in Fig. 5) has different mutations in the 5" splice site region and 
has an intact ATG. A comparison of the sequences in the BssHII-Cla I region of 
transfer constructs 1 and 2 (mBCwCN frag), transfer construct 3 (m2BCwCN frag), 
HXB2 and NL43 is shown in Fig. 12. 

EXAMPLE 7 
Preparation of Viral Particles 

Lentiviral particles were generated by transient cotransfection of 293 
human kidney cells with a combination of three plasmids: pCMVgagpolBNkan, 
pmBCwCNluci or pmBCmCNluci (transfer vector) and pHCMV-G (Yee et aL, 
Proc. Natl. Acad. Sci., USA, 91:9564-9568 (1994) a plasmid coding for the 
envelope VSV-G (glycoprotein of vesicular stomatitis virus). 

The day before the transfection, 293 cells were plated at a density of 
10 6 cells/plate on a 60 mm plate. Plasmid DNA was transfected by the Ca-phosphate 
precipitation method in the following proportions: 3 \ig packaging construct, 6 \ig 
transfer construct and 100 ng VSV-G encoding construct, pHCMV-G. [Note that 
the LTR promoter can be expressed in 293 cells in the absence of Tat with a 
moderate decrease in efficiency. The transfer constructs can be fully Tat 
independent after replacement of the LTR promoter with a CMV (see, e.g., transfer 
construct 3 in Fig. 5) or other promoter in such a way that the mRNA start site is at 
the beginning of the LTR R region.] In the present experiments for preparation of 
viral particles 500 ng of a Tat expression plasmid was included in the transfection. 



01/46408 



PCTAJS00/34985 



-34- 

Cells were washed the day after transfection and were kept in 
DMEM medium for another 48 hours before the supernatants were harvested. 
Superaatants were spun at 1,200 rpm for 7 mins to eliminate any floating cells. 
pCMVgagpolBNkan produces high levels of Gag protein that is efficiently released 
from the cells (Figure 13), and also produces high levels of functional Pol as judged 
by levels of reverse transcriptase activity similar to those found upon expression of 
complete HTV-1 (Figure 14). 

Supernatants from 293 transfected cells were used to transduce 
several human cell lines (293, Jurkat, U937) and non-dividing human primary 
macrophages. 

EXAMPLE 8 

Cell Transduction 

Transduction was performed by incubating for 3-4 hours at 37°C the 
target cells with 1-2 ml of supernatant containing the retroviral vectors. The amount 
of retroviral vector present in the supernatant was normalized by p24 content 
(measured by ELISA). Equal amounts of p24 gag protein were used for infection of 
cells. This way, differences in production of the different preparations was 
minimized. 

The macrophages used for transduction were isolated from the 
peripheral blood of healthy donors by adherence to plastic. Cells were cultured in 
RPMI + 20% fetal calf serum (FCS) + 10% human serum (HS). After 1 week, non- 
adherent cells were washed off with PBS and the macrophages were kept in culture 
for another 1-2 weeks in the absence of human serum. The cells were washed 2-4 
times with PBS before transduction. 

Cells were harvested 48 hours after transduction (seven days for 
primary macrophages) and the transduction efficiency was determined by measuring 
luciferase activity in cell extracts from the cultures. The results of the transduction 
experiments in 293 Jurkat, U937 and primary macrophages are shown in Figure 
15A-D. These results demonstrate that Rev-independent gag-HTV-1 based 
retroviral vectors display high transduction efficiency in (A) 293 cells, (B) human 
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lymphoid cells, (C) human myeloid cells (U937), as well as (D) non-dividing cells 
such as primary human macrophages. 

EXAMPLE 9 

Use Of Nucleic Acids of the Invention 
In Immunoprophvlaxis Or Immunotherapy 

In postnatal gene therapy, new genetic information has been 
introduced into tissues by indirect means such as removing target cells from the 
body, infecting them with viral vectors carrying the new genetic information, and 
then reimplanting them into the body; or by direct means such as encapsulating 
formulations of DNA ixl liposomes; entrapping DNA in proteoliposomes containing 
viral envelope receptor proteins; calcium phosphate co-precipitating DNA; and 
coupling DNA to a polylysine-glycoprotein carrier complex. In addition, in vivo 
infectivity of cloned viral DNA sequences after direct intrahepatic injection with or 
without formation of calcium phosphate coprecipitates has also been described. 
mRNA sequences containing elements that enhance stability have also been shown 
to be efficiently translated in Xenopus laevis embryos, with the use of cationic lipid 
vesicles. See, e.g., J.A. Wolff, et al., Science 247:1465-1468 (1990) and references 
cited therein. 

Recently, it has also been shown that injection of pure RNA or DNA 
directly into skeletal muscle results in significant expression of genes within the 
muscle cells. J.A. Wolff, etal., Science 247:1465-1468 (1990). Forcing RNA or 
DNA introduced into muscle cells by other means such as by particle-acceleration 
(N. -S. Yang, et al. Proc. Natl. Acad. ScL USA 87:9568-9572 (1990); S.R. Williams 
et al., Proc. Natl. Acad. Sci. USA 88:2726-2730 (1991)) or by viral transduction 
should also allow the DNA or RNA to be stably maintained and expressed. In the 
experiments reported in Wolff et al,, RNA or DNA vectors were used to express 
reporter genes in mouse skeletal muscle cells, specifically cells of the quadriceps 
muscles. Protein expression was readily detected and no special delivery system 
was required for these effects. Polynucleotide expression was also obtained when 
the composition and volume of the injection fluid and the method of injection were 
modified from the described protocol. For example, reporter enzyme activity was 
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reported to have been observed with 10 to 100 |J of hypotonic, isotonic, and 
hypertonic sucrose solutions, Opti-MEM, or sucrose solutions containing 2mM 
CaCl2 and also to have been observed when the 10- to 100- jjI injections were 
performed over 20 min. with a pump instead of within 1 min. 

Enzymatic activity from the protein encoded by the reporter gene 
was also detected in abdominal muscle injected with the RNA or DNA vectors, 
indicating that other muscles can take up and express polynucleotides. Low 
amounts of reporter enzyme were also detected in other tissues (liver, spleen, skin, 
lung, brain and blood) injected with the RNA and DNA vectors. Intramuscularly 
injected plasmid DNA has also been demonstrated to be stably expressed in non- 
human primate muscle. S. Jiao et al., Hum. Gene Therapy 3:21-33 (1992). 

It has been proposed that the direct transfer of genes into human 
muscle in situ may have several potential clinical applications. Muscle is 
potentially a suitable tissue for the heterologous expression of a transgene that 
would modify disease states in which muscle is not primarily involved, in addition 
to those in which it is. For example, muscle tissue could be used for the 
heterologous expression of proteins that can immunize, be secreted in the blood, or 
clear a circulating toxic metabolite. The use of RNA and a tissue that can be 
repetitively accessed might be useful for a reversible type of gene transfer, 
administered much like conventional pharmaceutical treatments. See J .A. Wolff, et 
al., Science 247:1465-1468 (1990) and S. Jiao et al., Hum. Gene Therapy 3:21-33 
(1992). 

It had been proposed by J. A. Wolff et ah, supra , that the intracellular 
expression of genes encoding antigens might provide alternative approaches to 
vaccine development. This hypothesis has been supported by a recent report that 
plasmid DNA encoding influenza A nucleoprotein injected into the quadriceps of 
B ALB/c mice resulted in the generation of influenza A nucleoprotein-specific 
cytotoxic T lymphocytes (CTLs) and protection from a subsequent challenge with a 
heterologous strain of influenza A virus, as measured by decreased viral lung titers, 
inhibition of mass loss, and increased survival. J. B. Ulmer et al., Science 
259:1745-1749(1993). 
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Therefore, it appears that the direct injection of RNA or DNA vectors 
encoding the viral antigen can be used for endogenous expression of the antigen to 
generate the viral antigen for presentation to the immune system without the need 
for self-replicating agents or adjuvants, resulting in the generation of antigen- 
specific CTLs and protection from a subsequent challenge with a homologous or 
heterologous strain of virus. 

CTLs in both mice and humans are capable of recognizing epitopes 

i 

derived from conserved internal viral proteins and are thought to be important in the 
immune response against viruses. By recognition of epitopes from conserved viral 
proteins, CTLs may provide cross-strain protection. CTLs specific for conserved 
viral antigens can respond to different strains of virus, in contrast to antibodies, 
which are generally strain-specific. 

Thus, direct injection of RNA or DNA encoding the viral antigen has 
the advantage of being without some of the limitations of direct peptide delivery or 
viral vectors. See J.A. Ulmer et al, supra , and the discussions and references 
therein). Furthermore, the generation of high-titer antibodies to expressed proteins 
after injection of DNA indicates that this may be a facile and effective means of 
making antibody-based vaccines targeted towards conserved or non-conserved 
antigens, either separately or in combination with CTL vaccines targeted towards 
conserved antigens. These may also be used with traditional peptide vaccines, for 
the generation of combination vaccines. Furthermore, because protein expression is 
maintained after DNA injection, the persistence of B and T cell memory may be 
enhanced, thereby engendering long-lived humoral and cell-mediated immunity. 

1 . Vectors for the immunoprophylaxis or 
immunotherapy against HIV-1 

The mutated gag.pol or gag/pol sequences will be inserted in 
expression vectors using a strong constitutive promoter such as CMV or RSV, or an 
inducible promoter such as HIV-1 . 

The vector will be introduced into animals or humans in a 
pharmaceutically acceptable carrier using one of several techniques such as 
injection of DNA directly into human tissues; electroporation or transfection of the 
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DNA into primary human cells in culture (ex vivo) , selection of cells for desired 
properties and reintroduction of such cells into the body, (said selection can be for 
the successful homologous recombination of the incoming DNA to an appropriate 
preselected genomic region); generation of infectious particles containing the gag 
gene, infection of cells ex vivo and reintroduction of such cells into the body; or 
direct infection by said particles in vivo. 

Substantial levels of protein will be produced leading to an efficient 
stimulation of the immune system. 

In another embodiment of the invention, the described constructs will 
be modified to express mutated Gag proteins that are unable to participate in virus 
particle formation. It is expected that such Gag proteins will stimulate the immune 
system to the same extent as the wild-type Gag protein, but be unable to contribute 
to increased HTV-1 production. This modification should result in safer vectors for 
immunotherapy and immunophrophylaxis. 

EXAMPLE 10 

Inhibition of HIV-1 Expression Using Transdominant 
(TDVTD-Gag-TD Rev or Td Gag-Pro-TD Rev Genes 

Direct injection of DNA or use of vectors other than retroviral 
vectors will allow the constitutive high level of trans-dominant Gag (TDgag) in 
cells. In addition, the approach taken by B.K. Felber et al., Science 239: 1 84-1 87 
(1988) will allow the generation of retroviral vectors, e.g. mouse-derived retroviral 
vectors, encoding HIV-1 TDgag, which will not interfere with the infection of 
human cells by the retroviral vectors. In the approach of Felber, et al., supra, it was 
shown that fragments of the HIV-1 LTR containing the promoter and part of the 
polyA signal can be incorporated without detrimental effects within mouse 
retroviral vectors and remain transcriptionally silent The presence of Tat protein 
stimulated transcription from the HTV-1 LTR and resulted in the high level 
expression of genes linked to the HTV-1 LTR. 

The generation of hybrid TDgag-TDRev or TDgag-pro-TDRev genes 
and the introduction of expression vectors in human cells will allow the efficient 
production of two proteins that will inhibit HTV-1 expression. The incorporation of 
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two TD proteins in the same vector is expected to amplify the effects of each one on 
viral replication. The use of the HTV-1 promoter in a matter similar to one 
described in B.K. Felber, et al., supra, will allow high level Gag and Rev expression 
in infected cells. In the absence of infection, expression will be substantially lower. 
Alternatively, the use of other strong promoters will allow the constitutive 
expression of such proteins. This approach could be highly beneficial, because of 
the production of a highly immunogenic gag, which is not able to participate in the 
production of infectious virus, but which, in fact, antagonizes such production. This 
can be used as an efficient immuniprophylactic or immunotherapeutic approach 
against AIDS. 

Examples of trans-dominant mutants are described in Trono et al., 

Cell 59:112-120 (1989). 

1 . Generation of constructs encoding transdominant Gag mutant 

proteins 

Gag mutant proteins that can act as trans-dominant mutants, as 
described, for example, in Trono et al., supra, will be generated by modifying vector 
p37Ml-10D or p55Ml-13P0 to produce transdominant Gag proteins at high 
constitutive levels. 

The transdominant Gag protein will stimulate the immune system 
and will inhibit the production of infectious virus, but will not contribute to the 
production of infectious virus. 

The added safety of this approach makes it more acceptable for 
human application. 
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Those skilled in the art will recognize that any gene encoding a 
mRNA containing an inhibitory/instability sequence or sequences can be modified 
in accordance with the exemplified methods of this invention or their functional 
equivalents. 

Modifications of the above described modes for carrying out the 
invention that are obvious to those of skill in the fields of genetic engineering, 
virology, immunology, medicine, and related fields are intended to be within the 
scope of the following claims. 

Every reference cited hereinbefore throughout the application is 
hereby incorporated by reference in its entirety. 
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WHAT IS CLAIMED IS : 

1 . A nucleic acid construct comprising a HTV-1 gag/pol gene 
having the coding sequence of the gag/pol gene set forth in Figure 1. 

2. A nucleic acid construct comprising a HTV-1 pol gene having 
the coding sequence of the pol gene set forth in Figure 2. 

3. A nucleic acid construct comprising a SIV-1 gag gene having 
the coding sequence of the gag gene set forth in Figure 3. 

4. A nucleic acid construct comprising an HIV or SIV 5* LTR, 
a packaging signal, a gag/pol gene comprising the sequence set forth in Figure 1 , a 
5' splice site, a 3' splice site, an env gene, a tat gene, a functional RNA transport 
element and a 3* HIV or SIV LTR, said nucleic acid construct being able to produce 
functional Gag, Pol and Env virion components. 

5. A vector comprising the nucleic acid construct of Claim 1, 2, 

3 or 4. 

6. A transformed host cell comprising the nucleic acid construct 
of Claim 1,2, 3 or 4. 

7. A transformed host cell of Claim 6 wherein said cell is a 



eukaryote. 



8. The host cell of Claim 7 wherein said cell is a human cell. 

9. A transformed host cell of Claim 6 wherein said cell is a 



prokaryote. 



10. The host cell of Claim 9 wherein said cell is E. coh. 

11. A pharmaceutical composition comprising the nucleic acid 
construct of Claim 1, 2, 3 or 4 and a pharmaceutical^ acceptable carrier. 

12. A method for inducing antibodies in a mammal comprising 
administering to a mammal a composition of claim 1 1, wherein said nucleic acid 
construct is present in an amount which is effective to induce said antibodies in said 
mammal, 

13. A method for inducing cytotoxic T lymphocytes in a mammal 
comprising administering to a mammal a composition of claim 1 1, wherein said 
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nucleic acid construct is present in an amount which is effective to induce cytotoxic 
T lymphocytes in said mammal. 

14. A vaccine composition for inducing immunity in a mammal 
against HIV infection comprising a pharmaceutical^ acceptable carrier and further 
comprising a therapeutically effective amount of a nucleic acid construct of Claim 1 
capable of producing HIV Gag and Pol proteins in the absence of HIV Rev 
regulatory protein in a cell in vivo . 

15. A vaccine composition for inducing immunity in a mammal 
against HTV infection comprising a pharmaceutical^ acceptable carrier and further 
comprising a therapeutically effective amount of a nucleic acid construct of Claim 2 
capable of producing HIV Pol protein in the absence of HIV Rev regulatory protein 
in a cell in vivo . 

1 6. A vaccine composition according to claim 14 wherein said 
mammal is a human. 

17. A vaccine composition according to claim 15 wherein said 
mammal is a human. 

18. A method for inducing immunity against HIV infection in a 
mammal which comprises administering to a mammal a therapeutically effective 
amount of a vaccine composition according to claim 14. 

19. A method for inducing immunity against HTV infection in a 
mammal which comprises administering to a mammal a therapeutically effective 
amount of a vaccine composition according to claim 15. 

20. A method according to claim 1 8 wherein said mammal is a 



human. 



human. 



21 . A method according to claim 19 wherein said mammal is a 



22. A lentiviral expression system comprising the following: 

(a) a packaging vector comprising a HTV-1 gag/pol gene having 
the nucleotide sequence set forth in Figure 1 ; 

(b) a transfer vector; and 

(c) an envelope encoding vector. 
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23 . A transformed host cell comprising the lentiviral expression 
system of Claim 22. 

24. A transformed host cell of Claim 23 wherein said cell is a 

eukaryote. 

25. The host cell of Claim 24 wherein said cell is a human cell. 

26. A process for making a lentiviral particle comprising 
expressing HIV Gag and HIV Pol in a host cell from a vector comprising the 
nucleotide sequences encoding HIV Gag and HIV Pol set forth in Figure 1 in the 
presence of a gene encoding an envelope protein. 

27. A lentiviral expression system which is capable of 
functioning in the absence of Rev, Tat, and any viral RNA transport element 
comprising the following: 

(a) a packaging vector comprising a HIV-1 gag/pol gene which 
has been mutated to eliminate inhibitory/instability regions; 

(b) a transfer vector; and 

(c) an envelope encoding vector. 

28. A transformed host cell comprising the lentiviral expression 
system of Claim 27. 

29. A transformed host cell of Claim 28 wherein said cell is a 

eukaryote. 

30. The host cell of Claim 29 wherein said cell is a human cell. 

31. A process for making a lentiviral particle in the absence of 
Rev, Tat, or any viral RNA transport element comprising expressing HIV Gag and 
HIV Pol in a host cell from a HIV-1 gag/pol gene which has been mutated to 
eliminate inhibitory/instability regions and expressing an Envelope protein from a 
envelope encoding gene whose expression is independent Rev, Tat, or any viral 
RNA transport element. 

32. A nucleic acid construct comprising a SIV-1 env gene having 
the coding sequence of the env gene set forth in Figure 16. 

33. A vector comprising the nucleic acid construct of claim 32. 

34. A transformed host cell comprising the nucleic acid construct 

of claim 32. 
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35. A pharmaceutical composition comprising the nucleic acid 
construct of claim 32 and a pharmaceutical^ acceptable carrier. 
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FIGURE 1 

ATGGGTGCGAGAGCGTCAGTATTAAGCGGGGGAGAATTAGATCGATGGGAAAAAAT 
TCGGTTAAGGCCAGGGGGAAAGAAGAAGTACAAGCTAAAGCACATCGTATGGGCAA 

GC AGGG AGCT AGAAC GAT T C G CAG T T AAT CC T GG CC TGT T AGAAACAT C AG AAG GC 
TGTAGAC^lAATACTGGGACAGCTACAACCATCCCTTCAGACAGGATCAGAGGAGCT 
TCGATCACTATACAACACAGTAGCAACCCTCTATTGTGTGCACCAGCGGATCGAGA 
TCAAGGACACCAAGGAAGCTTTAGACAAGATAGAGGAAGAGCAAAACAAGTCCAAG 

AAGAAG GC C CAG C AG GCAGCAGCT GAC ACAG GAC AC AGCAAT C AGGT CAGC CAAAA 
TTACCCTATAGTGCAGAACATCCAGGGGCAAATGGTACATCAGGCCATATCACCTA 
GAACTTTAAATGCATGGGTAAAAGTAGTAGAAGAGAAGGCTTTCAGCCCAGAAGTG 
AT ACC C AT G T T T TC AGCAT TAT C AGAAG G AGCCAC C C C ACAGG ACC T GAAC AC GAT 
G T T G AAC AC CGTGGGGG GAC AT C AAG CAG C C AT GC AAAT G T T AAAAG AG AC CAT C A 
AT GAG G AAG C T G CAG AAT G G GAT AG AG T GC AT C CAG T G C ATG C AG GGCCTATTG C A 
CCAGGCCAGATGAGAGAACCAAGGGGAAGTGACATAGCAGGAACTACTAGTACCCT 

T C AG G AACAAAT AGG AT GG AT GAC AAATAAT CCAC CT AT CC C AG TAG G AGAG AT C T 
ACAAGAGGTGGATAATCCTGGGATTGAACAAGATCGTGAGGATGTATAGCCCTACC 
AG CAT T CTG G AC AT AAGAC AAGG ACCAAAGG AAC CC T T TAG AG AC TAT G TAG AC C G 
G T TC T AT AAAACT C T AAG AGC T G AGCAAGC T TCAC AGG AGGTAAAAAAT T GG ATG A 
CAGAAACCTTGTTGGTCCAAAATGCGAACCCAGATTGTAAGACCATCCTGAAGGCT 
CTCGGCCCAGCGGCTACACTAGAAGAAATGATGACAGCATGTCAGGGAGTAGGAGG 
ACCCGGCCATAAGGCAAGAGTTTTGGCCGAGGCGATGAGCCAGGTGACGAACTCGG 
CGACCATAATGATGCAGAGAGGCAACTTCCGGAACCAGCGGAAGATCGTCAAGTGC 
TTCAATTGTGGCAAAGAAGGGCACACCGCCAGGAACTGCCGGGCCCCCCGGAAGAA 
GGGCTGTTG GAAAT GTGGAAAGGAAGG ACAC CAAATGAAAGAT TGT ACTGAG AGAC 
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AGGCTAATTTTTTAGGGAAGATCTGGCCTTCCTACAAGGGAAGGCCAGGGAATTTT 

CT T C AG AGCAG ACCAGAGC C AACAGC CC C ACC AGAAGAGAGC T T CAGGT C TG GGGT 

AG AGAC AACAAC TC CCCC T CAGAAGCAG G AGC CGAT AGACAAGGAAC TGT AT CC T T 

TAACTTCCCTCAGATCACTCTTTGGCAACGACCCCTCGTCACAGTAAGGATCGGGG 

G G C AAC T C AAG G AAGCG C T G C T C GAT AC AG GAG C AG AT G AT ACAG TAT T AG AAG AA 

ATGAGTTTGCCAGGAAGATGGAAACCAAAAATGATAGGGGGGATCGGGGGCTTCAT 

CAAG G T GAG GC AG TAG GACCAGATAC T CATAGAAAT C T GT GGACATAAAG C TAT AG 

GTACAG TAT TAG TAG GACCT ACAC C T G T CAACATAAT T GGAAGAAAT C T GT T GACC 

CAGATCGGCTGCACCTTGAACTTCCCCATCAGCCCTATTGAGACGGTGCCCGTGAA 

GTTGAAGCCGGGGATGGACGGCCCCAAGGTCAAGCAATGGCCATTGACGAAAGAGA 

AGATCAAGGCCTTAGTCGAAATCTGTACAGAGATGGAGAAGGAAGGGAAGATCAGC 

AAGATCGGGCCTGAGAACCCCTACAACACTCCAGTCTTCGCAATCAAGAAGAAGGA 

CAG TACCAAG TGGAGAAAGC TGG TGGAC T TCAGAGAGC TGAACAAGAGAAC TCAGG 

ACTTCTGGGAAGTTCAGCTGGGCATCCCACATCCCGCTGGGTTGAAGAAGAAGAAG 

TCAGTGACAGTGCTGGATGTGGGTGATGCCTACTTCTCCGTTCCCTTGGACGAGGA 

C T T CAG GAAG T AC AC T GC C T T C ACG AT ACC T AG CAT C AAC AAC G AGACACC AG G C A 

TCCGCTACCAGTACAACGTGCTGCCACAGGGATGGAAGGGATCACCAGCCATCTTT 

CAAAGCAGCATGACCAAGATCCTGGAGCCCTTCCGCAAGCAAAACCCAGACATCGT 

GATCTATCAGTACATGGACGACCTCTACGTAGGAAGTGACCTGGAGATCGGGCAGC 

ACAG G AC CAAG AT C GAGGAG CT GAGACAGCAT C T GT T G AGG TGGGGACT GACC AC A 

CCAGACAAGAAGCACCAGAAGGAACCTCCCTTCCTGTGGATGGGCTACGAACTGCA 

TCCTGACAAGTGGACAGTGCAGCCCATCGTGCTGCCTGAGAAGGACAGCTGGACTG 

TGAACGACATACAGAAGCTCGTGGGCAAGTTGAACTGGGCAAGCCAGATCTACCCA 

GGCATCAAAGTTAGGCAGCTGTGCAAGCTGCTTCGAGGAACCAAGGCACTGACAGA 
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AGTGATCCCACTGACAGAGGAAGCAGAGCTAGAACTGGCAGAGAACCGAGAGATCC 

TGAAGGAGCCAGTACATGGAGTGTACTACGACCCAAGCAAGGACCTGATCGCAGAG 

ATCCAGAAGCAGGGGCAAGGCCAATGGACCTACCAAATCTACCAGGAGCCCTTCAA 

GAACCTGAAGACAGGCAAGTACGCAAGGATGAGGGGTGCCCACACCAACGATGTGA 

AGCAGCTGACAGAGGCAGTGCAGAAGATCACCACAGAGAGCATCGTGATCTGGGGC 

AAGACTCCCAAGTTCAAGCTGCCCATACAGAAGGAGACATGGGAGACATGGTGGAC 

CGAGTACTGGCAAGCCACCTGGATCCCTGAGTGGGAGTTCGTGAACACCCCTCCCT 

TGGTGAAACTGTGGTATCAGCTGGAGAAGGAACCCATCGTGGGAGCAGAGACCTTC 

TACGTGGATGGGGCAGCCAACAGGGAGACCAAGCTGGGCAAGGCAGGCTACGTGAC 

C AACCGAGGACG AC AGAAAG TGG TGACCC TGAC TGACACCACCAACCAGAAGAC TG 

AGCTGCAAGCCATCTACCTAGCTCTGCAAGACAGCGGACTGGAAGTGAACATCGTG 

AC AG AC T C AC AG T AC G C AC T G G G CAT CAT C C AAG C AC AAC C AG AC C AAT C C GAG T C 

AG AG C T G G T G AAC C AG AT CAT C GAG C AGC T GAT C AAG AAG G AG AAAG T G T AC C T G G 

CAT G G G T ACCAGC ACACAAAG GAAT T G GAG GAAAT G AACAAG T AGAT AAAT T AG TC 

AGTGCTGGGATCCGGAAGGTGCTGTTCCTGGACGGGATCGATAAGGCCCAAGATGA 

ACATGAGAAGTACCACTCCAACTGGCGCGCTATGGCCAGCGACTTCAACCTGCCAC 

C TG TAG TAGCAAAAGAAATAG TAGCCAGCTGTGATAAATG TCAGCTAAAAGGAGAA 

GC C AT GC AT G GACAAG T AGAC T GT AGT C C AGGAAT AT GGCAG CT GG AC TGC ACG CA 

CCTGGAGGGGAAGGTGATCCTGGTAGCAGTTCATGTAGCCAGTGGATATATAGAAG 

C AGAAG T TAT CCCT GC TGAAAC TG G GCAGGAAACAGC AT AT T T T CT T T TAAAAT T A 

GCAGGAAGATGGCCAGTAAAAACAATACACACGGACAACGGAAGCAACTTCACTGG 

TGCTACGGTTAAGGCCGCCTGTTGGTGGGCGGGAATCAAGCAGGAATTTGGAATTC 

CCTACAATCCCCAATCGCAAGGAGTCGTGGAGAGCATGAACAAGGAGCTGAAGAAG 

ATCATCGGACAAGTGAGGGATCAGGCTGAGCACCTGAAGACAGCAGTGCAGATGGC 
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AGTGTTCATCCACAACTTCAAAAGAAAAGGGGGGATTGGGGGGTACAGTGCAGGGG 
AAAGGATCGTGGACATCATCGCCACCGACATCCAAACCAAGGAGCTGCAGAAGCAG 
ATCACCAAGATCCAGAACTTCCGGGTGTACTACCGCGACAGCCGCAACCCACTGTG 
GAAGGGACCAGCAAAGCTCCTCTGGAAGGGAGAGGGGGCAGTGGTGATCCAGGACA 
ACAGTGACATCAAAGTGGTGCCAAGGCGCAAGGCCAAGATCATCCGCGACTATGGA 
AAAC AG ATG G C AG G T GAT GAT T GT G TGG C AAGTAGAC AGGATGAGGAT T AGAAC C T 
GGAAGAGCCTGGTGAAGCACCATATG (SEQUENCE ID N0:1) 
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FIGURE 2 



>wildtype 
>mutated 

#i 



TGTACAGAGA TGGAAAAGGA AGGGAAAATT TCAAAAATTG 
TGTACAGAGA TGGAGAAGGA AGGGAAGATC AGCAAGATCG 

* + + *** * * 



>wildtype 
>mutated 
#41 



>wildtype 
>mutated 
#81 



>wildtype 
>mutated 
#121 



>wildtype 
>mutated 
#161 



>wildtype 
>mutated 
#201 



GGCCTGAAAA TCCATACAAT ACTCCAGTAT TTGCCATAAA 
GGCCTGAGAA CCCCTACAAC ACTCCAGTCT TCGCAATCAA 



+ + * 



+ * + + 



GAAAAAAGAC AGTACTAAAT GGAGAAAATT AGTAGATTTC 
GAAGAAGGAC AGTACCAAGT GGAGAAAGCT GGTGGACTTC 



+ * * * + 



AGAGAACTTA ATAAGAGAAC TCAAGACTTC TGGGAAGTTC 
AGAGAGCTGA ACAAGAGAAC TCAGGACTTC TGGGAAGTTC 

+ + * + 

AATTAGGAAT ACCACATCCC GCAGGGTTAA AAAAGAAAAA 
AGCTGGGCAT CCCACATCCC GCTGGGTTGA AGAAGAAGAA 



ATCAGTAACA GTACTGGATG TGGGTGATGC ATATTTTTCA 
GTCAGTGACA GTGCTGGATG TGGGTGATGC CTACTTCTCC 

1 1* A # + + + 



>wildtype 
>mutated 
#241 



>wildtype 
>mutated 
#281 



>wildtype 
>mutated 
#321 



GTTCCCTTAG ATGAAGACTT CAGGAAATAT ACTGCATTTA 
GTTCCCTTGG ACGAGGACTT CAGGAAGTAC ACTGCCTTCA 



+ + * 



+ + 



CCATACCTAG TATAAACAAT GAGACACCAG GGATTAGATA 
CGATACCTAG CATCAACAAC GAGACACCAG GCATCCGCTA 



* + 



+ ** * 



TCAGTACAAT GTGCTTCCAC AGGGATGGAA AGGATCACCA 
CCAGTACAAC GTGCTGCCAC AGGGATGGAA GGGATCACCA 



>wildtype 
>mutated 
#361 



>wildtype 
>mutated 
#401 



GCAATATTCC AAAGTAGCAT GACAAAAATC TTAGAGCCTT 
GCCATCTTTC AAAGCAGCAT GACCAAGATC CTGGAGCCCT 



* +• * 



TTAGAAAACA AAATCCAGAC ATAGTTATCT ATCAATACAT 
TCCGCAAGCA AAACCCAGAC ATCGTGATCT ATCAGTACAT 
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>wildtype GGATGATTTG TATGTAGGAT CTGACTTAGA AATAGGGCAG 

>mutated GGACGACCTC TACGTAGGAA GTGACCTGGA GATCGGGCAG 



#441 



>wildtype 
>mutated 
#481 



CATAGAACAA AAATAGAGGA GCTGAGACAA CATCTGTTGA 
CACAGGACCA AGATCGAGGA GCTGAGACAG CATCTGTTGA 



>wildtype 
>mutated 
#521 



GGTGGGGACT TACCACACCA GACAAAAAAC AT CAGAAAG A 
GGTGGGGACT GACCACACCA GACAAGAAGC ACCAGAAGGA 



>wildtype ACCTCCATTC CTTTGGATGG GTTATGAACT CCATCCTGAT 
>mutated ACCTCCCTTC CTGTGGATGG GCTACGAACT GCATCCTGAC 
#561 



>wildtype AAATGGACAG TACAGCCTAT AGTGCTGCCA GAAAAAGACA 
>mutated AAGTGGACAG TGCAGCCCAT CGTGCTGCCT GAGAAGGACA 
#601 



>wildtype GCTGGACTGT CAATGACATA CAGAAGTTAG TGGGGAAATT 
>mutated GCTGGACTGT GAACGACATA CAGAAGCTCG TGGGCAAGTT 
#641 • 



>wildtype GAATTGGGCA AGTCAGATTT ACCCAGGGAT TAAAGTAAGG 

>mutated GAACTGGGCA AGCCAGATCT ACCCAGGCAT CAAAGTTAGG 

#681 



>wildtype 
>mutated 
#721 



CAATTATGTA AACTCCTTAG AGGAACCAAA GCACTAACAG 
CAGCTGTGCA AGCTGCTTCG AGGAACCAAG GCACTGACAG 



>wildtype 
>mutated 
#•761 



AAGTAATACC ACTAACAGAA GAAGCAGAGC TAGAACTGGC 
AAGTGATCCC ACTGACAGAG GAAGCAGAGC TAGAACTGGC 



>wildtype AGAAAACAGA GAGATTCTAA AAGAACCAGT ACATGGAGTG 
>rautated AGAGAACCGA GAGATCCTGA AGGAGCCAGT ACATGGAGTG 
#801 



>wildtype TATTATGACC CATCAAAAGA CTTAATAGCA GAAATACAGA 
>mutated TACTACGACC CAAGCAAGGA CCTGATCGCA GAGATCCAGA 
#841 
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>wildtype 
>mutated 
#881 



AGCAGGGGCA AGGCCAATGG ACATATCAAA TTTATCAAGA 
AGCAGGGGCA AGGCCAATGG ACCTACCAAA TCTACCAGGA 



>wildtype 
>mutated 
#921 



GCCATTTAAA AATCTGAAAA CAGGAAAATA TGCAAGAATG 
GCCCTTCAAG AACCTGAAGA CAGGCAAGTA CGCAAGGATG 



>wildtype AGGGGTGCCC ACACTAATGA TGTAAAACAA TTAACAGAGG 

>mutated AGGGGTGCCC ACACCAACGA TGTGAAGCAG CTGACAGAGG 

#961 

+ + + + + * * 



>wildtype CAGTGCAAAA AATAACCACA GAAAGCATAG TAATATGGGG 

>mutated CAGTGCAGAA GATCACCACA GAG AG CAT CG TGATCTGGGG 

#1001 

+ * + * + * + 



>wildtype AAAGACTCCT AAATTTAAAC TGCCCATACA AAAGGAAACA 

>mutated CAAGACTCCC AAGTTCAAGC TGCCCATACA GAAGGAGACA 

#1041 

* * * + + + * 



>wildtype TGGGAAACAT GGTGGACAGA GTATTGGCAA GCCACCTGGA 

>mutated TGGGAGACAT GGTGGACCGA GTACTGGCAA GCCACCTGGA 

#1081 ■ 

+ + * 



>wildtype TTCCTGAGTG GGAGTTTGTT AATACCCCTC CTTTAGTGAA 
>mutated TCCCTGAGTG GGAGTTCGTG AACACCCCTC CCTTGGTGAA 
#1121 



>wildtype ATTATGGTAC CAGTTAGAGA AAGAACCCAT AGTAGGAGCA 
>mutated ACTGTGGTAT CAGCTGGAGA AGGAACCCAT CGTGGGAGCA 
#1151 



>wildtype 
>mutated 
#1201 



GAAACCTTCT ATGTAGATGG GGCAGCTAAC AGGGAGACTA 
GAGACCTTCT ACGTGGATGG GGCAGCCAAC AGGGAGACCA 



>wildtype 
>mutated 
#1241 



AATTAGGAAA AGCAGGATAT GTTACTAATA GAGGAAGACA 
AGCTGGGCAA GGCAGGCTAC GTGACCAACC GAGGACGACA 



> wild type AAAAGTTGTC ACCCTAACTG ACACAACAAA TCAGAAGACT 
>mutated GAAAGTGGTG ACCCTGACTG ACACCACCAA CCAGAAGACT 
#1281 
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>wildtype 
>mutated 
#1321 



GAGTTACAAG CAATTTATCT AGCTTTGCAG GATTCGGGAT 
GAGCTGCAAG CCATCTACCT AGCTCTGCAA GACAGCGGAC 



* * * 



+ + ** 



>wildtype 
>mutated 
#1361 



TAGAAGTAAA CATAGTAACA GACTCACAAT ATGCATTAGG 
TGGAAGTGAA CATCGTGACA GACTCACAGT ACGCACTGGG 

+ * +** +*+* 



>wildtype 
>mutated 
#1401 



AATCATTCAA GCACAACCAG ATCAAAGTGA ATCAGAGTTA 
CATCATCCAA GCACAACCAG ACCAATCCGA GTCAGAGCTG 



* + * 



>wildtype 
>mutated 
#1441 



GTCAATCAAA TAATAGAGCA GTTAATAAAA AAGGAAAAGG 
GTGAACCAGA TCATCGAGCA GCTGATCAAG AAGGAGAAAG 



* * + + 



>wildtype 
>mutated 
#1481 



TCTATCTGGC ATGGGTACCA GCACACAAAG GAATTGGAGG 
TGTACCTGGC ATGGGTACCA GCACACAAAG GAATTGGAGG 



>wildtype 
>mutated 
#1521 



AAATGAACAA GTAGATAAAT TAGTCAGTGC TGGAATCAGG 
AAATGAACAA GTAGATAAAT TAGTCAGTGC TGGGATCCGG 



>wildtype 
>mutated 
#1561 



AAAGTACTAT TTTTAGATGG AATAGATAAG GCCCAAGATG 
AAGGTGCTGT TCCT.GGACGG GATCGATAAG GCCCAAGATG 

+*★ + *■ * + + + 



>wildtype 
>mutated 
#1601 



AACATGAGAA AT AT CAC AGT AATTGGAGAG CAATGGCTAG 
AACATGAGAA GTACCACTCC AACTGGCGCG CTATGGCCAG 



+. * * * + 



>wildtype 
>mutated 
#1641 



TGATTTTAAC CTGCCACCTG TAGTAGCAAA AGAAATAGTA 
CGACTTCAAC CTGCCACCTG TAGTAGCAAA AGAAATAGTA 

* * * 



>wildtype 
>mutated 
#1681 



GCCAGCTGTG ATAAATGTCA GCTAAAAGGA GAAGCCATGC 
GCCAGCTGTG ATAAATGTCA GCTAAAAGGA GAAGCCATGC 



>wildtype 
>mutated 
#1721 



ATGGACAAGT AGACTGTAGT CCAGGAATAT GGCAACTAGA 
ATGGACAAGT AGACTGTAGT CCAGGAATAT GGCAGCTGGA 
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>wildtype TTGTACACAT TTAGAAGGAA AAGTTATCCT GGTAGCAGTT 
>mutated CTGCACGCAC CTGGAGGGGA AGGTGATCCT GGTAGCAGTT 
#1-761 



>wildtype CATGTAGCCA GTGGATATAT AGAAGCAGAA GTTATTCCAG 

>mutated CATGTAGCCA GTGGATATAT AGAAGCAGAA GTTATCCCTG 



1*1801 



>wildtype 
>mutated 
#1841 



CAGAAACAGG GCAGGAAACA GCATATTTTC TTTTAAAATT 
CTGAAACTGG GCAGGAAACA GCATATTTTC TTTTAAAATT 



>wildtype 
>mutated 
#1881 



AGCAGGAAGA TGGCCAGTAA AAACAATACA TACAGACAAT 
AGCAGGAAGA TGGCCAGTAA AAACAATACA CACGGACAAC 



>wildtype 
>mutated 
#1921 



GGCAGCAATT TCACCAGTGC TACGGTTAAG GCCGCCTGTT 
GGAAGCAACT TCACTGGTGC TACGGTTAAG GCCGCCTGTT 



>wildtype 
>mutated 
#1961 



GGTGGGCGGG AATCAAGCAG GAATTTGGAA TTCCCTACAA 
GGTGGGCGGG AATCAAGCAG GAATTTGGAA TTCCCTACAA 



>wildtype TCCCCAAAGT CAAGGAGTAG TAGAATCTAT GAATAAAGAA 
>mutated TCCCCAATCG CAAGGAGTCG TG GAG AG CAT GAACAAGGAG 
#2001 



>wildtype TTAAAGAAAA TTATAGGACA GGTAAGAGAT CAGGCTGAAC 

>mutated CTGAAGAAGA TCATCGGACA AGTGAGGGAT CAGGCTGAGC 

#2041 

* + + ■+ * * + * + 



>wildtype ATCTTAAGAC AGCAGTACAA ATGGCAGTAT TCATCCACAA 
>mutated ACCTGAAGAC AGCAGTGCAG ATGGCAGTGT TCATCCACAA 
#2081 



>wildtype TTTTAAAAGA AAAGGGGGGA TTGGGGGGTA CAGTGCAGGG 
>mutated CTTCAAAAGA AAAGGGGGGA TTGGGGGGTA CAGTGCAGGG 
#2121 



>wildtype GAAAGAATAG TAGACATAAT AGCAACAGAC ATACAAACTA 

>mutated GAAAGGATCG TGGACATCAT CGCCACCGAC ATCCAAACCA 

#2161 :* 
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>wildtype 
>mutated 
#2201 



AAGAATTACA AAAACAAATT ACAAAAATTC AAAATTTTCG 
AGGAGCTGCA GAAGCAGATC ACCAAGATCC AGAACTTCCG 



>wildtype 
>mutated 
#2241 



GGTTTATTAC AGGGACAGCA GAAATCCACT TTGGAAAGGA 
GGTGTACTAC CGCGACAGCC GCAACCCACT GTGGAAGGGA 



>wxldtype 
>mutated 
#2281 



CCAGCAAAGC TCCTCTGGAA AGGTGAAGGG GCAGTAGTAA 
CCAGCAAAGC TCCTCTGGAA GGGAGAGGGG GCAGTGGTGA 



>v/ildtype 
>mutated 
#2321 



TACAAGATAA TAGTGACATA AAAGTAGTGC CAAGAAGAAA 
TCCAGGACAA CAGTGACATC AAAGTGGTGC CAAGGCGCAA 



>wildtype AGCAAAGATC ATTAGGGATT ATGGAAAACA GATGGCAGGT 
>mutated GGCCAAGATC ATCCGCGACT ATGGAAAACA GATGGCAGGT 
#2361 



>wildtype GATGATTGTG TGGCAAGTAG ACAGGATGAG GATTAGAACA 
>mutated GATGATTGTG TGGCAAGTAG ACAGGATGAG GATTAGAACC 
82401 



>wildtype 
>mutated 
#2441 



TGGAAAAGTT TAGTAAAACA CCATATG 
TGGAAGAGCC TGGTGAAGCA CCATATG 
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FIGURE 3 



ATGGGCGTGAGAAACTCCGTCTTGTCAGGGAAGAAAGCAGATGAATTAG 

AAAAAATTAGGCTACGACCCAACGGAAAGAAAAAGTACATGTTGAAGC 

ATGTAGTATGGGCAGCAAATGAATTAGATAGATTTGGATTAGCAGAAAG 

CCTGTTGGAGAACAAAGAAGGATGTCAAAAAATACTTTCGGTCTTAGCT 

CCATTAGTGCCAACAGGCTCAGAAAATTTAAAAAGCCTTTATAATACTG 

TCTGCGTCATCTGGTGCATTCACGCAGAAGAGAAAGTGAAACACACTGA 

GGAAGCAAAACAGATAGTGCAGAGACACCTAGTGGTGGAAACAGGAAC 

CACCGAAACCATGCCGAAGACCTCTCGACCAACAGCACCATCTAGCGGC 

AGAGGAGGAAACTACCCAGTACAGCAGATCGGTGGCAACTACGTCCAC 

CTGCCACTGTCCCCGAGAACCCTGAACGCTTGGGTCAAGCTGATCGAGG 

AGAAGAAGTTCGGAGCAGAAGTAGTGCCAGGATTCCAGGCACTGTCAG 

AAGGTTGCACCCCCTACGACATCAACCAGATGCTGAACTGCGTTGGAGA 

CCATCAGGCGGCTATGCAGATCATCCGTGACATCATCAACGAGGAGGCT 

GCAGATTGGGACTTGCAGCACCCACAACCAGCTCCACAACAAGGACAA 

CTTAGGGAGCCGTCAGGATCAGACATCGCAGGAACCACCTCCTCAGTTG 

ACGAACAGATCCAGTGGATGTACCGTCAGCAGAACCCGATCCCAGTAGG 

CAACATCTACCGTCGATGGATCCAGCTGGGTCTGCAGAAATGCGTCCGT 

ATGTACAACCCGACCAACATTCTAGATGTAAAACAAGGGCCAAAAGAG 

CCATTTCAGAGCTATGTAGACAGGTTCTACAAAAGTTTAAGAGCAGAAC 

AGACAGATGCAGCAGTAAAGAATTGGATGACTCAAACACTGCTGATTCA 

AAATGCTAACCCAGATTGCAAGCTAGTGCTGAAGGGGCTGGGTGTGAAT 

CCCACCCTAGAAGAAATGCTGACGGCTTGTCAAGGAGTAGGGGGGCCG 

GGACAGAAGGCTAGATTAATGGCAGAAGCCCTGAAAGAGGCCCTCGCA 

CCAGTGCCAATCCCTTTTGCAGCAGCCCAACAGAGGGGACCAAGAAAGC 

CAATTAAGTGTTGGAATTGTGGGAAAGAGGGACACTCTGCAAGGCAATG 

CAGAGCCCCAAGAAGACAGGGATGCTGGAAATGTGGAAAAATGGACCA 

TGTTATGGCCAAATGCCCAGACAGACAGGCGGGTTTTTTAGGCCTTGGT 

CCATGGGGAAAGAAGCCCCGCAATTTCCCCATGGCTCAAGTGCATCAGG 

GGCTGATGCCAACTGCTCCCCCAGAGGACCCAGCTGTGGATCTGCTAAA 

GAACTACATGCAGTTGGGCAAGCAGCAGAGAGAAAAGCAGAGAGAAAG 

CAGAGAGAAGCCTTACAAGGAGGTGACAGAGGATTTGCTGCACCTCAAT 

TCTCTCTTTGGAGGAGACCAGTAG 
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FIGURE 4 



SIV gag . 

#i 

ATGGGCGT.GAGAAACTCCGTCTTGTCAGGGAAGAAAGCAG 

SIV gag 

#41 

AT G AAT T AGAAAAAAT T AG G C T AC G AC C C AAC G G AAAG AA 

SIV gag 

#81 

AAAGTACATGTTGAAGCATGTAGTATGGGCAGCAAATGAA 

SIV gag 

#121 

T TAG AT AG At T T GGAT T AGC AG AAAG C CT G T T G G AGAAC A 

SIV gag 

#161 

AAGAAGGATGTCAAAAAATACTTTCGGTCTTAGCTCCATT 

SIV gag 

#201 ' 

AG T G C C AAC AG G C T C AG AAAAT T T AAAAAGCC T T TAT AAT 

SIV gag 

#241 

ACTGTCTGCGTCATCTGGTGCATTCACGCAGAAGAGAAAG 

SIV gag 

SIVgagDX . . 

#281 

T GAAAC AC AC T G AGG AAGC AAAAC AGAT AG T GC AG AG AC A 

SIV gag A— A T A—A 

SIVgagDX. . C—C C G--G 

#321 

CCTAGTGGTGGAAACAGGAACMACMGAAACYATGCCRAAR 

SIV gag --AAG-A " 

SIVgagDX. . --CTC-C 

#361 

ACMWSTMGACCAACAGCACCATCTAGCGGCAGAGGAGGAA 
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SIV gag -T A — A — A T 

SIVgagDX. . -C G--G--C C 

#401 

AY TACCCAGTACARCARATMGGTGGTAAC TAYGTCCACC T 

SIV gag T-AAG AT -A — T — C A — AT-- 

SIVgagDX . . C-GTC CC-G--C — T C— GO- 

#441 

GCCAYTRWSCCCGAGAACMYTRAAYGCYTGGGTMAARYTG 

SIV gag --A A A--T 

SIVgagDX. C G G--C 

#481 

ATMGAGGARAAGAART T YGGAGC AGAAGT AG T GCCAGGAT 

SIV gag -T T T— 

SIVgagDX. . -C C C— 

#521 

TYCAGGCACTGTCAGAAGGTTGCACCCCCTAYGACATYAA 

SIV gag T T-A--T-- T--G A 

SIVgagDX. .C C-G--C--C--T G 

#561 

YCAGATGYTRAAYTGYGTKGGAGACCATCARGCGGCTATG 

SIV gag T A-A--T— T--A 

SIVgagDX. . C C-T--C--C--C 

#601 

CAGATYATCMGWGAYATYATMAACGAGGAGGCTGCAGATT 

SIV gag 

SIVgagDX. . 

#641 

GGGACTTGCAGCACCCACAACCAGCTCCACAACAAGGACA 

SIV gag T — T A — T 

SIVgagDX. . C--C C~ C 

#681 

AC T TAGGGAGC CG TCAGGATCAGAYAT YGCAGGAACMAC Y 

SIV gag AGT A — T A A-A--A- 

SIVgagDX. . TCC T--C G • C-T — G- 

#721 

WSYTCAGTWGAYGAACARATCCAGTGGATGTACMGWCARC 
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SIV gag C — A T -A-GA 

SIVgagDX. . G--C C— C-TC 

#761 

AGAACCCSATMCCAGTAGGCAACATYTACMGKMGATGGAT 

SIV gag A GT A — A — T — CA-A T A 

SIVgagDX. . G TC G — G — C TC-T C G 

#801 

CCARCTGGGKYTGCARAARTGYGTYMGWATGTAYAACCCR 



SIV gag ~A 

SIVgagDX. .--C 

#841 

ACMAACATTC TAGATGTAAAACAAGGGCCAAAAGAGCCAT 

SIV gag 

#881 

T T C AGAGC T AT G T AGAC AGG T T C T ACAAAAG T T TAAGAG C 

SIV gag 

#921 

AG AAC AG AC AG AT G C AG C AG T AAAGAAT T GG AT G AC T C AA 

SIV gag 

#961 

ACACTGCTGATTCAAAATGCTAACCCAGATTGCAAGCTAG 

SIV gag . . 

#1001 

T G C TGAAGG G GCT G GG T GT GAAT C C CACC CT AGAAGAAAT 

SIV gag 7 

#1041 

GCTGACGGCTTGTCAAGGAGTAGGGGGGCCGGGACAGAAG 

SIV gag 

#1081 

GCTAGATTAATGGCAGAAGCCCTGAAAGAGGCCCTCGCAC 

SIV gag 

#1121 

CAGTGCCAATCCCTTTTGCAGCAGCCCAACAGAGGGGACC 



SIV gag 
#1161 



AAGAAAGCCAATTAAGTGTTGGAATTGTGGGAAAGAGGGA 
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SIV gag 

#1201 

CAC T C T GCAAGGCAAT GCAGAGC C CCAAGAAGACAGGGAT 

SIV gag 

#1241 

G C T G GAAAT G TG GAAAAAT GGACCAT G T T AT GGCCAAAT G 

SIV gag 

#1281 

CCCAGACAGACAGGCGGGTTTTTTAGGCCTTGGTCCATGG 

SIV gag 

#1321 

GGAAAGAAGCCCCGCAATTTCCCCATGGCTCAAGTGCATC 

SIV gag 

#1361 

AGGGGCTGATGCCAACTGCTCCCCCAGAGGACCCAGCTGT 

SIV gag 

#1401 

G GAT C TG C T AAAG AAC T ACAT G CAG T T G GG CAAGC AGC AG 

SIV gag 

#1441 

AGAGAAAAGCAGAGAGAAAGCAGAGAGAAGCC T TACAAGG 

SIV gag 

#1481 

AGGTGACAGAGGATTTGCTGCACCTCAATTCTCTCTTTGG 



SIV gag 

#1521 

AGGAGACCAGTAG 
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FIGURE 6 



Sail (758) 
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FIGURE 7 



BssHII (711) 




Kpnl (3563) 



PCTAJS00/34985 
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FIGURE 8 




BssHII (711) 
Clal (830) 
Accl (9S9) 



Clal (3259) 

Xhol (3548) 
Apal (3557) 
Kpnl (3563) 
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FIGURE 9 



1 


CCTCGCCATT GCATACGTTC 


TATCCATATC 


ATAATATGTA 


CATTTATAT7 GGCTCATGTC 


CAACATTACC 


71 


GCCATGTPGA 


CATTGATTAT 


TGACTAGTTA 


TTAATAGTAA 


TCAATTACGG 


GGTCATTAGT 


TCATAGCCCA 


141 


TATATGGAGT 


TCCGCGTTAC 


ATAACTTACG 


GTAAATGGCC 


CGCCTGGCTG 


ACCGCCCAAC 


GACCCCCGCC 


211 


CATTGACGTC 


AATAATGACG 


TATGTTCCCA 


TAGTAACGCC 


AATAGGGACT 


TTCCATTGAC 


GTCAATGGGT 


281 


GGAGTATTTA 


CGGTAAACTG 


CCCACTTGGC 


AGTACATCAA 


GTGTATCATA 


TGC CAAGT AC 


GCCCCCTAT7 


351 


GACGTCAATG 


ACGGTAAATG 


GCCCGCCTGG 


CATTATGCCC 


AGTACATGAC 


CTTATGGGAC 


TTTCCTACTT 


421 


GGCAGTACAT 


CTACGTATTA 


GTCATCGCTA 


TTACCATGGT GATGCGGTTT TGGCAGTACA 


TCAATGGGCG 


491 


TGGATAGCGG 


TITGACTCAC 


GGGGATTTCC 


AAGTCTCCAC 


CCCATTGACG 


TCAATGGGAG 


TTTGTTTTGG 


561 


CACCAAAATC 


AACGGGACTT 


TCCAAAATGT 


CGTAACAACT 


CCGCCCCATT 


GACGCAAATG 


GGCGGTAGGC 


631 


GTGTACGGTG 


GGAGGTCTAT 


ATAAGCAGAG 


CTC3TTTAGT 


GAACCGTCAG 


ATCGCCTGGA 


GACGCCATCC 


701 


ACGCTGTTTT 


GACCTCCATA 


aAAGACACCG 


GGACCGATCC 


AGCCTCCGCG 


Sail (75f 

GGCGCGCGTC GACAGAGAGA 


771 


TGGGTGCGAG 


AGCGTCAGTA 


TTAAGCGGGG 


GAGAATTAGA 


TCGATGGGAA 


AAAATTCGGT 


TAAGGCCAGG 


841 




AAGTACAAGC 


TAAAGCACAT 


CGTATGGGCA 


AGCAGGGAGC 


TAGAACGATT 


CGCAGTTAAT 


911 


CCTGGCCTGT 


TAGAAACATC 


AGAAGGCTGT 


AGACAAATAC 


TGGGACAGCT 


ACAACCATCC 


CTTCAGACAG 


?ai 


GA7CAGAGGA 


GCTTCGATCA 


CTATACAACA 


CnGTAGCAAC 


CCTCTATTGT 


GTGCACCAGC 


GGATCGAGAT 


1051 


CAAGGACACC 


AAGGAAGCTT 


TAGACAAGAT 


AGAGGAAGAG 


CAAAACAAGT 


CCAAGAAGAA 


GGCCCAGCAG 


1121 


GCAGCAGCTG 


ACACAGGACA 


CAGCAATCAG 


GTCAGCCAAA 


ATTACCCTAT 


AGTGOiGAAC 


ATCCAGGGGC 


1191 


AAATGGTACA 


TCAGGCCATA 


TCACCTAGAA 


CTTTAAATGC 


ATGGGTAAAA 


GTAGTAGAAG 


AGAAGGCTTT 


1261 


CAGCCCAGAA 


GTGATACCC=i 


TGTTTTCAGC 


ATTATCAGAA 


GGAGCCACCC 


CACAGGACCT 


GAACACGATG 


1331 


TTGAACACCG 


TGGGGGGACA 


TCAAGCAGCC 


ATGCAAATGT 


TAAAAGAGAC 


CATCA^TGAG 


GAAGCTGCAG 


1401 


AATGGGATAG 


AGTGCATCCA 


GTGCATGCAG 


GGCCTATTGC 


ACCAGGCCAG 


ATGAGAGAAC 


CAAGGGGAAG 


1471 


TGACATAGCA 


GGAACTACTA 


GTACCCTTCA 


GGAACAAATA 


GGATGGATGA 


CAAATAATCC 


ACCTATCCCA 


1541 


GTAGGAGAGA 


TCTACAAGAG 


GTGGATAATC 


CTGGGATTGA 


ACAAGATCGT 


GAGCATGTAT 


AGCCCTACCA 


1611 




CATAAGACAA 


GGACCAAAGG 




AGACTATGTA 




ATAAAACTCT 


GCATICIGGA 


AACVL-rrrAG 


GACCGGTTCT 


1681 


AAGAGCTGAG 


CAAGCTTCAC 


AGGAGGTAAA 


AAATTGGATG 


ACAGAAACCT 


TGTTQGTCCA 


AAATGCGAAC 


1751 


CCAGATTGTA 


AGACCATCCT 


GAAGGCTCTC 


GGCCCAGCGG 


CTACACTAGA 


AGAAATGATG 


ACAGCATGTC 


1821 


AGGGAoTAGG 


AGGACCCGGC 


CATAAGGCAA 


GAGTTrTGGC 


CGAGGCGATG 


AGCCAGGTGA 


CGAACTCGGC 


1891 


GACCATAATG 


ATGCAGAGAG 


GCAACTTCCG 


GAACCAGCGG 


AAGATCGTCA 


AGTGCTTCAA TTGTGGCAAA 
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19S1 


GAAGGGCACA CCGCCAGGAA CTGCCGGGCC CCCCGGAAGA AGCGCTGTIT3 GAAATGTGGA AAGGAAGG&C 




ACCAAATGAA AGATTGTACT GAGAGACAGG 


CTAATTTTTT AGGGAAGATC TGGCCTTCCT ACAAGGGAAG 




GCCAGGGAAT 


TTTCTTCAGA GCAGACCAGA GCCAACAGCC CCACCAGAAG AGAGCTTCAG GTCTGGGGTA 


2171 


GAGACAACAA 


CTCCCCCTCA 


GAAGCAGGAG 


CCGATAGACA AGGAACTGTA 


TCCITIAACT 


TCCCTCAGAT 


2241 


CACTCTTTGG 


CAACGACCCC 


TCCTCACAGT 


AAGGATCGGC GGGCAACTCA 


AGGAAGCGCT 


GCTCGATACA 


2311 


GGAGCAGATG 


ATACAGTATT 


AGAAGAAATG 


AGTTTCCCAG GAAGATCGAA 


4 » * * 4 ^ «*v* 


ATAGGGGGGA 


2381 


TCGGGGGCTT 


CATCAAGGTG 


AGGCAGTACG 


ACCAGATACT CATAGAAATC 


TCTGGACATA 


AAGCTATAGG 


2451 


TACAGTATTA 


GTAGGACCTA 


CACCIGTCAA 


CATAATCGGA AGAAATCTGT TGACCCAGAT CCGCTGCACC 


2521 

v W w A 


TTGAACTTCC 


CCATCAGCCC 


TATTGAGACG 


GTGCCCGTGA AGTTGAAGCC GGGGATGGAC 


GGCCCCAAGG 


2591 


TCAAGCAATC 


GCCATTGACG 


AAAGAGAAGA 


TCAAGGCCTT AGTCGAAATC TGTACAGAGA TGGAGAAGGA 


26S1 


AGGGAAGATC 


AGCAAGATCG 


GGCCTGAGAA 


CCCCTACAAC ACTCCAGTCT 


TCGCAA1CAA 


GAAGAAGGAC 


2731 


AGTACCAAGT 


GGAGAAAGCT 


GGTGGACTTC 


AGAGAGCTGA ACAAGAGAAC 


TCAGGACTTC 


TGGGAAGTTC 




AGCTGGGCAT 


CCCACATCCC 


GCTGGGTTGA 


AGAAGAAGAA GTCAGTGACA 


GTGCTGGATG 


TGGGTGATGC 


iO f 1 


C7ACTTCTCC 


GTTCCCTTGG 


ACGAGGACTT 


CAGGAAGTAC ACTGCCTTCA 


CGATACCTAG 


CATCAACAAC 


TO/!! 


GAGACACCAG 


GCATCCGCTA 


CCAGTACAAC 


GTGCIGCCAC AGGGATGGAA GGGATCACCA 


GCCATCTTTC 


Iftl 1 


AAAGCAGCAT 


GACCAAGATC 


CTGGAGCCCT 


TCCG2AAGCA AAACCCAGAC 


ATCGTGATCT 


ATCAGTACAT 


30H1 


GGACGACCTC 


TACGTAGGAA 


GTGACCTGGA 


GATCGGGCAG CACAGGACCA 


AGATCGAGGA 


GCTGAGACAG 


3151 


CATCTGTTGA 


GGT3GGGACT 


GACCACACCA 


GACAAGAAGC ACCAGAAGGA 


Accrcccrrc: 


CTGTGGATGG 


3221 


GCTACGAACT 


GCATCCTGAC 


AAGTGGACAG 


TGCAGCCCAT CGTGCTGCCT 


GAGAAGGACA 


GCTGGACTGT 


3291 


GAACGACATA 


CAGAAGCTCG 


TGGGCAAGTT 


GAACTGGGCA AGCCAGATCT 


ACCCAGGCAT 


CAAAGTTAGG 


3351 


CAGCTGTGCA 


AGCTGC7TCG 


AGGAACCAAG 


GCACTGACAG AAGTGATCCC 


ACTGACAGAG 


GAAGCAGAGC 


3431 


TAGAACTGGC 


AGAGAACCGA 


GAGA7CCTGA 


AGGAGCCAGT ACATGGAGTG 


TACTACGACC 


CAAGCAAGGA 


3501 


CCTGATCGCA 


GAGATCCAGA 


AGCAGGGGCA 


AGGCCAA7GG ACCTACCAAA 


TCTACCAGGA 


GCCCTPCAAG 


3571 


AACCTGAAGA 


CAGGCAAGTA 


CGCAAGGATG 


AGGGGTGCCC ACACCAACGA 


TGTGAAGCAG 


CTGACAGAGG 


3641 


CAGTGCAGAA 


GATCACCACA 


GAGAGCATCG 


TGATCTGGGG CAAGACTCCC 


AAGTTCAAGC 


TGCCCATACA 


3711 


GAAGGAGACA 


TGGGAGACAT 


GGTGGACCGA 


GTACTGGCAA GCCACCTGGA 


TCCCTGAGTG 


GGAGTTCGTG 


3*781 


AACACCCCTC 


CCTTGGTGAA 


ACTGTGGTAT 


CAGCTGGAGA AGGAACCCAT 


CGTGGGAGCA 


GAGACCTTCT 


3851 


ACG7CGATGG 


GGCAGCCAAC 


AGGGAGACGA 


AGCTGGGCAA GGCAGGCTAC 


GTGACCAACC 


GAGGACGACA 



3921 GAAAGTGGTG ACCCTGACTG ACACCACCAA CCAGAAGACT GAGCTGCAAG CCATCTACCT AGCTCTGCAA 



3991 GACAGCGGAC TGGAAGTGAA CATCGTGACA GACTCACAGT ACGCACTGGG CATCATCCAA GCACAACCAG 
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4061 


ACCAATCCGA 


GTCAGAGCTG 


GTGAACCAGA 


TCATCGAGCA 


GCTGATCAAG 


AAGGAGAAAG 


TGTACCTGGC 


4131 


ATGGGTACCA GCACACAAAG GAATTGGAGG AAATGAACAA GTAGATAAAT TAGTCAGTGC TGGGATCCGG 


4201 


AAGGTGCTGT 


TCCTGGACGG 


GATCGATAAG 


GCCCAAGATG 


AACATGAGAA 


GTACCACTCC 


AACTGGCGCG 


4271 


CTATGGCCAG 


CGACTTCAAC 


CTGCCACCTG 


TAGTAGCAAA 


AGAAATAGTA 


GCCAGCTGTG 


(ins * h^r**1V*1t 

ATAAATvTIw* 


4341 


GCTAAAAGGA 


GAAGCCATGC 


ATGGACAAGT 


AGACTGTAGT CCAGGAATAT 


GGCAGCTGGA 




4411 


CTGGAGGGGA 


AGGTGATCCT 


GGTAGCAGTT 


CATGTAGCCA 


GTGGATATAT 


AGAAGCAGAA 


GTTATCCCTG 


4481 


CTGAAACTGG 


GCAGGAAACA GCATATTTIC TTTTAAAATT AGCAGGAAGA TGGCCAGTAA 


AAACAATACA 


4551 


CACGGACAAC 


GGAAGCAACT 


TCACTGGTGC 


TACGGTTAAG 


GCCGCCTGTT 


GGTGGGCGGG 


AATCAAGCAG 


4621 


GAATITGGAA 


TTCCCTACAA 


TCCCCAATCG 


CAAGGAGTCG 


TGGAGAGCAT 


GAACAAGGAG 


CTGAAGAAGA 


4691 


TCATCGGACA 


AGTGAGGGAT 


CAGGCTGAGC 


ACCTGAAGAC 


AGCAGTGCAG 


ATGGCAGTGT 


TCATCCACAA 


4761 


CTTCAAAAGA 


AAAGGGGGGA 


TTGGGGGGTA 


CAGTGCAGGG 


GAAAGGATCG 


TGGACATCAT 


CGCCACCGAC 


4831 


ATCCAAACCA 


AGGAGCTGCA 


GAAGCAGATC 


ACCAAGATCC 


AGAACTTCCG 


GGTGTACTAC 


CGCGACAGCC 


4901 


GCAACCCACT 


GTGGAAGGGA 


CCAGCAAAGC 


TCCTCTGGAA 


GGGAGAGGGG 


GCAGTGGTGA 


TCCAGGACAA 


4971 


CAGTGACATC 


AAAGTGGTGC 


CAAGGCGCAA 


GGCCAAGATC 


ATCCGCGACT 


ATGGAAAACA 


GATGGCAGGT 


5041 


GATGATTGTG 


TGGCAAGTAG 


ACAGGATGAG 


GATTAGAACC 


TGGAAGAGCC 


TGGTGAAGCA 


CCATATGGCG 




















Nhel (5117) 










5111 


BstBI 

TTCGAAGCTA 


(5111) 

GCCTCGAGAT 


CCAGATCTGC 




AGTTGCCAGC 


CATCTGTTG7 


TTGCCCCTCC 


5181 


CCCGTGCCTT 


CCTTGACCCT 


GGAAGGTGCC 


ACTCCCACTG 


TGCTTTCCTA 


ATAAAATCAG 


GAAATTGCAT 


5251 


CGCA1TGTCT 


GAGTAGGTG7 


CAT7CTATTC 


T*o 3 GOG GT^jsj 


GGTGCrGGCAG 


CACAGCAAGG 


GGGAGGATTG 


5321 




AGCAGGCATG 


CTGGGGATC*? 


GGTGGGCTCT 




AGGTG CTG AA 


GAATTGACCC 


5391 


G3T7CCTCCT 


GGGCvJAGrw* 


GAAGCAGGCA 


CA7CCCC7TC 


TCT1TGACAC 


ACCCTGTCCA 


CGCCCCTGGT 


5461 


TCT7AGTTCC 


AGCCCCACTC 


ATAGGAGAC -* 


CATAGCPGAG 


GAGCGCTCCG 


CCTTCAATCC 


CACCCGCTAA 


5531 


AGTACTTGGA 


GCGG7CTCTC 


CCTCCCTCAT 


CAC Z CC ACCA 


AACVAAACCT 


AGCCTCCAAG 


AGTGGGAAGA 


5601 


AATTAAAGCA 


AGATAGGC T.-i 


TTA-AGTGCAG 


AGG GAGAGAA 


AATCCCTCCA 


ACATGTGAGG 


AAGTAATGAG 



■ 5671 AGAAATCA7A GAATTICTTC CGCTTCCTCG CTCACTGACT CGCTGCGCTC GGTCGTTCGG CTGCGGCGAG 



5741 CGGTATCAGC TCACTCAAAG GCGGTAATAC GGTTATCCAC AGAATCAGGG GATAACGCAG GAAAGAACAT 

5811 GTGAGCAAAA GGCCAGCAAA AGGCCAGGAA CCGTAAAAAG GCCGCGTTGC TGGCGTTTTT CCATAGGCTC 

5881 CGCCCCCCTG ACGAGCATCA CAAAAATCGA CGCTCAAGTC AGAGGTGGCG AAACCCGACA GGACTATAAA 

5951 GATACCAGGC GTTTCCCCCT GGAAGCTCCC TCGTGCGCTC TCCTGTTCCG ACCCTGCCGC TTACCGGATA 

6021 CCTGTCCGCC TTTCTOXTT CGGGAAGCGT GGCGCTTTCT CAATGCTCAC GCTGTAGGTA TCTCACTTCG 

6091 GTGTAGGTCG TTCGCTCCAA GCTGGGCTGT GTGCACGAAC CCCCCGTTCA GCCCGACCGC TGCGCCTTAT 

6161 CCGGTAACTA TCGTCTTGAG TCCAACCCGG TAAGACACGA CTTATCGCCA CTGGCAGCAG CCACTGGTAA 

6231 CAGGATTAGC AGAGCGAGGT ATGTAGGCGG TGCTACAGAG TTCTTGAAGT GGTGGCCTAA CTACGGCTAC 
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6301 ACTAGAAGGA CAGTATTIGG TATCTGCGCT CTGCTGAAGC CAGTTACCTT CGGAAAAAGA GTTGGTAGCT 
6371 CTTGATCCGG CAAACAAACC ACCGCTGGTA GCGGTCGTIT TnTGTTTGC AAGCAGCAGA TTACGCGCAG 
6441 AAAAAAAGGA TCTCAAGAAG ATCCTTTGAT CTTTTCTACG GGGTCTGACG CTCAGTGGAA CGAAAACTCA 
6511 CGTTAAGGGA TTTTGGTCAT GAGATTATCA AAAAGGATCT TCACCTAGAT CCTTTTAAAT TAAAAATGAA 
6581 GTTTTAAATC AATCTAAAGT ATATATGAGT AAACTTGGTC TGACAGTTAC CAATGCTTAA TCAGTGAGGC 
6651 ACCTATCTCA GCGATCTGTC TATTTCGTTC ATCCATAGTT GCCTGACTCC GGGGGGGGGG GGCGCTGAGG 
6721 TCVGCCTCGT GAAGAAGGTS TTGCTGACTC ATACCAGGCC TGAATCGCCC CATCATCCAG CCAGAAAGTG 
6791 AGGGAGCCAC GGTTGATGAG AGCnTGTTG TAGGTGGACC AGTCGGTGAT TTTGAACTTT TGCTTTGCCA 
6861 CGGAACGGTC TGCGTTGTCG GGAAGATGCG TGATCTGATC CTTCAACTCA GCAAAAGTTC GATTTATTCA 
6931 ACAAAGCCGC CGTCCCGTCA AGTCAGCGTA ATGCTCTGCC AGTGTTACAA CCAATTAACC AATTCTGATT 
7001 AGAAAAACTC ATCGAGCATC AAATGAAACT GCAATTTATT CATATCAGGA TTATCAATAC CATATTTCTG 
271* PhePheGI u AspleuMetl euHisPhed nLeuLysAsn MetAspProA snAsplleGI yTyrLysGIn 
7071 AAAAAGCCGT TTCTGTAATG AAGGAGAAAA CTCACCGAGG CAGTTCCATA GGATGGCAAG ATCCTGGTAT 

248< PheUeuArgL ysGlnLeuSe rProSerPhe GluGlyLeuC ysAsnTrpLo ulleAlaLeu AspGlnTyrA 
7141 CGGTCTGCGA TTCCGACTCG TCCAACATCA ATACAACCTA TTAATTTCCC CTCGTCAAAA ATAAGGTTAT 

224 * r gAspAl a 1 1 eGlyValArg GlyValAspl teCysGlyll eLeuLysGly GluAspPhel leLeuAsnAs 
7211 CAAGTGAGAA ATCACCATGA GTGACGACTG AATCCGGTGA GAATGGCAAA AGCTTATGCA TTTCTTTCCA 

201*pLeuSer Phe AspGIyHisT hrValValSe rAspProSer PheProLeuL euLysHisMe tGluLysTrp 
7281 GACTTGTTCA ACAGGCCAGC CATTACGCTC GTCATCAAAA TCACTCGCAT CAACCAAACC GTTATTCATT 

178< Val Gl nGI uV aiProTrpGf yAsnArgGlu AspAspPheA spSerAlaAs pValLeuGIy AsnAsnMatA 
7351 CGTGATTGCG CCTGAGCGAG ACGAAATACG CGATCGCTGT TAAAAGGACA ATTACAAACA GGAATCGAAT 

154 <r gSer GlnAI aGI nAI aLeu ArgPheValA rgAspSerAs nPheProCys AsnCysValP rolleSerHi 
7421 GCAACCGGCG CAGGAACACT GCCAGCGCAT CAACAATATT TTCACCTGAA TCAGGATATT CTTCTAATAC 

131< sLeuArgArg LeuPheValA laLeuAtaAs pVallleAsn GluGlySerA spProTyrGI uGluLeuVal 
7491 CTGGAATSCT GTTTTCCCGG GGATCGCAGT GGTGAGTAAC CATGCATCAT CAGGAGTACG GATAAAATGC 

108«G! nPheAl aT hrLysGlyPr oMeAlaTtir ThrLeuLeuT rpAlaAspAs pProThrArg HePheHisL 
7561 TTGATGGTCG GAAGAGGCAT AAATTCCGTC AGCCAGTITA GTCTGACCAT CTCATCTGTA ACATCATTGG 
S4*ysl I eThr Pr oLeuProMet PheGluThrl euTrpAsnLe uArgValMet GtuAspThrV alAspAsnAI 
7631 CAACGCTACC TTTGCCATGT TTCAGAAACA ACTCT3GCGC ATCGGGCTTC CCATACAATC GATAGATTGT 
61<aVal SerG!y LysGlyHisL ysLeuPheLe uGluProAla AspProLysG lyTyrLeuAr gTyrlleThr 
7701 CGCACCTGAT TGCCCGACAT TATCGCGAGC CCATTTATAC CCATATAAAT CAGCATCCAT GTTGGAA1TT 
38< Al aGI ySer G InGlyValAs nAspArgAI a TrpLysTyrG lyTyrLeuAs pAlaAspMet AsnSerAsnL 
7771 AATCGCGGCC TCGAGCAAGA CGTTTCCCGT TGAATATGGC TCATAACACC CCTTGTATTA CTCTTTATGT 

14<euArgProAr gSerCysSer Thr Gl uArgG t n 1 1 eHi sSe rWat 
7841 AAGCAGACAG TTTTATTGTr CATGATGATA TATTTITATC TTGTGCAATG TAACATCAGA GATTITGAGA 
7911 CACAACGTGG CTCTCCCCCC CCCCCCATTA TTGAAGCATT TATCAGGGTT ATTGTCTCAT GAGCGGATAC 
7981 ATATTTGAAT GTATTTAGAA AAATAAACAA ATAGGGGTTC CGCGCACATT TCCCCGAAAA GTGCCACCTG 
8051 ACGTCTAAGA AACCATTATT ATCATGACAT TAACCTATAA AAATAGGCGT ATCACGAGGC CCTTTCGTCT 
8121 CGCGCGTTTC GGTGATGACG GTGAAAACCT CTGACACATG CAGCTCCCGG AGACGGTCAC AGCITGTCTG 
8191 TAAGCGGATG CCGGGAGCAG ACAAGCCCGT CAGGGCGCGT CAGCGGGTGT TGGCGGGTGT CGGGGCTGGC 
8261 TTAACTATGC GGCATCAGAG CAGATTGTAC TGAGAGTGCA CCATATGCGG TGTGAAATAC CGCACAGATG 
8331 CGTAAGGAGA AAATACCGCA TCAGATTGGC TATTGG 
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1 TCGAAGGCCT AATTTGGTCC CAAAAAAGAC AAGAGATCCT TGATCTGTGG ATCTACCACA CACAAGGCTA 



71 


cttccctgat 


TGGCAGAACT ACACACCAGG 


GCCAGGGATC 


AGATATCCAC 


TGAlXTi'lt^i 


ATGGTGCTTC 


141 


AAGTTAGTAC 


CAGTTGAACC AGAGCAAGTA GAAGAGGCCA 


AATAAGGAGA 


GAAGAACAGC 


TTGTTACACC 


211 


CTATGAGCCA 


GCATGGGATG GACGACCCGG 


AGGGAGAAGT 


ATTAGIUTUG 




GCCTCCTAGC 


281 


ATTTCGTCAC 


ATGGCCCGAG AGCTGCATCC 


GGAGTACTAC 


AAAGACTGCT 


GACATCGAGC 


TTTCTACAAG 


351 


GGACTTTCCG 


CPGGGGA w L'l TCCAGGGAGG 


TGTGGCCTGG 


GCGGGACTGG 


GGAG7GGCGA 


GCCCTCAGAT 


421 


GCTACATATA 


AGCAGCTGCT TTTTGCCTGT 


ACTGGGTCTC 


TCTGG1TAGA 


CCAGATCTGA 


GCCTGGGAGC 


491 


TCTCTGGCTA 






AAGCTTGCCT 


TGAGTGCTCA 




ACTAGGGAAC CCACiUllTA 


AGCCTCAATA 


AAGTAGTG1G 


561 


TGCCCGTCTG 


TTGTGTGACT CTGGTAACTA 




AGACCCmT 


AGTCAGTGTG 




GAGATCCCTC 


GAAAATCTCT 



631 AGCAGTGGCG CCCGAACAGG GACTTGAAAG CGAAAGTAAA GCCAGAGGAG ATCTCTCGAC GCAGGACTCG 



BssHII (711) 



701 


GCTTGCTGAA 


GCGo3^2CS3 


caagaggoga 




tngACcagGa 




zzzgaczagc 














Cial (830 


771 




acgagagagC 








TCGATGvajAA 














» 




841 


AAAA 1 ' - C GGT 


TAA GGC AG^ 




A-k-i - A - .---\AT 


TA-lA-. ZATAT 


AGTA T^GGGCA 


AGCAGGGAGC 


911 


TV-.GnA C^jATT 


CGCAG - ' * AAT 


^ \— — " — 
v. -~ .JTJw <. . o . 


TA j.-AA 3AT*C 


Accl (959) 

A3AAGGCTGT AGACAAATAC TGGGACAGCT 


981 


■r" r - — — w < > wt» 


CTTCAGACAG 


GAT^AGA-.GA 


A CTT A G.-.TC A 


TTATA.TAATA 


CAGTAGCAAC 


CCTCTATTGT 


1051 


GTGCATCAAA 


GGATAGAGAT 


AAAAjACACC 


AAGGAAGOTT 


TAGACAAGAT 


AGAGG.-AGAG 


CAAAACAAAA 


1121 


GTAAGAAAAA 


AGCACAGCAA 


GGAGCAGC7G 


ACACAGGACA. 


CAGCAATCAG 


GTCAGCCAAA 


ATTACCCTAT 


1191 


A.GTGCAGAAC 


» r*y*A* mamma 


AAA i sjGTACA 


TCAGGCCATA 


TCACCTAGAA 


CTTTAAACGA 


TAAGCTTGGG 


1261 


AGTTCCGCGT 


TACATAACiT 


ACGGTAAATG 


GCCCGCCTGG 


CTGACCGCCC 




AACGACCCCC 


GCCCATTGAC 


1331 


GTCAATAATO 


ACGTATGTTC 


CCATAGTAAC 


GCCAATAGGG 


AC77ICCATT 


GACGTCAATG 


GGTGGAGTAT 


1401 


TTACGGTAAA 


CTGCCCACTT 


GGCAGTACAT 


CAAGTGTATC 


ATATGCCAAG 


TACGCCCCCT ATTGACGTCA 


1471 


ATGACGGTAA 


ATGGCCCCCC 


TGGCATTATG 


CCCAGTACAT 


GACCTTATGG 


GAcrnuerA 


CTTGGCAGTA 


1541 


CATC7ACGTA 


TTAGTCATCG 


CTATTACCAT 


GGTGATGCGG 


TTTTGGCAGT 


ACATCAATGG 


GCGTGGATAG 


1611 


CGGTTTGACT 


CACGGGGA7T 


TCCAAGTCTC 


CACCCCATTG 


ACGTCAATGG 


GAG-iTitrnr 


TGGCACCAAA 


ieai 








ACTCCGCCCC 


ATTGACGCAA 


ATGGGCGGTA 




riTCAACGGGA 




TGTCGTAACA 


GGCGTGTACG 


1751 


GTGGGAGGTw 


TATATAAGCA 


GAGCTCGTIT 


AGTGAACCGT 


CAGATCGCCT 


GGAGACGCCA 


TCCACGCTGT 
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1821 


TTTGACCTCC 


ATAGAAGACA 


CCGACTTTAG 


AGgatccATC 


TAAGTAAGCT 


TGGCA.'.vCG 




1891 


TAAAATGGAA 


GACGCCAAAA 


ACATAAAGAA 


AGGCCCGGCS 


CCATTCTATC 


CTCTAGAGGA 


TGGAACCGCT 


1961 


GGAGAGCAAC 


TGCATAAGGC 


TATGAAGAGA 


TACGCCCTGG 


TTCCTGGAAC 


AA-".u>». 


ACAGATGCAC 


2031 


ATATCGAGGT 


GAACATCACG 


TACGCGGAAT 


ACTTCGAAAT 


GTCCGTTCGG 


TTGGCAGAAG 


CTATGAAACG 


2101 


ATATGGGCTC 


AATACAAATC 


ACAGAATCGT 


CGTATGCAGT GAAAACTCTrC 


^p^h^ * * ^^^^ "■ *! 


TATGCCGGTG 


2171 


TTGGGCGCGT 


TATTTATCGG 


AGTTGCAGTT 




ACGACATTTA 


TAATGAA w GT 


GAATTGCTCA 


2241 


ACAGTATGAA 


CATCTCGCAG 


CCTACCGTAG 




CAAAAAGGGG 


TTGCAAAAAA 


TTTTGAACGT 


2311 


Gwu-fcAA/tnnA 


TTACCAATAA 


T^CCAGAAAAT 


TATTATCATG 


GATTCTAAAA 


CGGA'i'iACCA 


GGGATTTCAG 


2381 


TCGATG7ACA 


CGTICGTCAC 


ATCTCATCTA 




TTAATGAATA 




CCAGAGTCCT 


CCTCCCGGTT 


CGAr:*r:x;7A 


2451 


TTGATCGTGA 


CAAAACAATT 


GCACTGATAA 


TGAATTCCTC 


TGGATCTACT 


GGGTTACCTA 


AGGGTGTGGC 


2521 


CCTTCCGCAT 


AGAACTGCCT 


GCGTCAGATT 


CTCGCATGCC 


AGAGATCCTA 


TTTTTGGCAA 


TCAAATCATT 


2591 


CCGGATACTG 


CGATTTTAAG 


TGTTGTTCCA 


TTCCATCACG 


GTTTTGGAAT 


GTT7ACTACA 


CTCGGATATT 


2661 


TGATATGTGG 


ATTTCGAGTC 


GTCTTAATGT 


ATAGATTTGA 


AGAAGAGCTG 


TTTTTACGAT 


CCCTTCAGGA 


2731 


1TACAAAATT 


CAAA3TGCGT 


TGCTAGTACC 


* * * l-l nil 


TCV7TC7TCG 


CCAAAAGCAC 


TCTGATTGAC 


2801 


^■—^ATjrt CunTT 


TATCTAA7TT 




Gn_V.'- .'jvjvjvj 


GCGCACCTCT 


TTCGAAAGA.-V 


GTCGGGGAAG 


2871 


CG 'ji i" 1 sjCAAA 


ACGCTTCCAT 


CTTCCAGGGA 


TACGA CAA G3 


ATATGGGCTC 




CATCAGCTAT 


2941 


TCTGATTACA 


CCCGAGGGGG 


AToATAAACC 


GGGrGCGGTC 


GGTAAAGTTG 


TrccAriYrr 


TGAAGCGAAG 


3011 


GTTGTGGATC 


TGGATACCGG 


GAAAACG CTG 


GGCG l'i AATC 


AGAGA 3GCGA 


AT7ATGTGTC 


AGAGGACCTA 


3081 


TGATTATGTC 


CGGTTAT3TA 


AACAATCCGG 


AAG C 3AC r AA 


CGCCTTGATT 


GACAAGGATG 


GATGGCTACA 


3151 


TTCTGGAGAC 


ATAGCTTACT 




CGAACACTTC 


TTCATAGTTG 


ACCGCTTGAA 


GTCTTTAATT 


3221 


nAATACAAAG 


G A * ATC AGGT 


GGCC "CCGCT 


Clal (3259) 

GAAT?GGAAT CGATATTGTT ACAACACCCC 


AACATCTTCG 


3291 


ACGCGGGCGT 


GGCAGGTCTT 


CCCGACGATG 


ACGCCGGTGA 


ACTTCCCGCC 


GCCGTTGTTG 


ttttggagca 


3361 


CGGAAAGACG 


ATGACGGAAA 


AAGnu^TCGT 


GGA ™ TACGTC 


GCCAGTCAAG 


TAACAACCGC 


GAAAAAGTTG 


3431 


CGCGGAGGAG 


TTGTGTTTGT 


GGACGAAGTA 


CCGAAAGGTC 


TTACCGGAAA 


ACTCGACGCA 


AGAAAAATCA 


3501 


GAGAGATCCT 


CATAAAGGCC 


AAGAAGGGCG 


GAAAG TCCAA 


Apal (3557) 
Xhol (3548) Kpnl ( 

ATTGTAACTC GAGGGGGGGC CCGGTACCTT 


3571 


TAAGACCAAT GACTTACAAG 


GCAGCIGTAG 


ATCTTAGCCA 


CTTTTTAAAA 


GAAAACGGGG 


GACTGGAAGG 


3641 




TCCwAAAGAA 


GACAAGATAT 


CCTTGATCTG 


TGGATCTACC 


ACACACAAGG 


CTACTTCCCT 
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3711 GMTGGCAGA ACTACACACC AGGGCCAGGG GTCAGATATC CACTGACCTT TGGATGGTGC TACAAGCTAG 



3781 


TACCAGTTGA 


GCCAGATAAG 


GTAGAAGAGG 


CCAATAAAGG 


AGAGAACACC 


AGCTKMTAC 


ACCCTGTGAG 


3851 


CCTGCATGGA 


ATGGATGACC 


CTGAGAGAGA AGTGTTAGAG 


TGGAGGTTTG ACAGCCCCCT 


AGCATTTCAT 


3921 


CACGTGGCCC 


GAGAGCTGCA 


TCCGGAGTAC 


TTCAAGAACT 


GCTGACATCG 


AGCTTGCTAC 


AAGGGACTTT 


3991 


CCGCTGGGGA 


CTTTCCAGGG 


AGGCCTGGCC 


TGGGCGGGAC 


TGGGGAGTGG 


CGAGCCCTCA 


GATGCTCCAT 


4061 


ATAAGCAGCT 


GCTTTTTGCC 


TGTACTGGGT 


CTCTCTGGTT AGACCAGATC 


TGAGCCTGGG 


AGCTCTCTGG 


4131 


CTAACTAGGG 


AACCCACTCC 


TTAAGCCTCA 


ATAAAGCTTG 


CCTTGAGTGC 


TTCAAGTAGT 


GTCTGCCCGT 


4201 




ACTCTGGTAA 


CTAGAGATCC 


CTCAGACCCr 


TTTAGTCAGT 




TCTAGCACCC 


CTGTTGTGTG 


GTGGAAAATC 



4271 CCCAGGAGGT AGAGGTTGCA GTGAGCCAAG ATCGCGCCAC TGCATTCCAG CCTGGGCAAG AAAACAAGAC 

4341 TCTCTAAAAT AATAATAATA AGTTAAGGGT ATTAAATATA TTTATACATG GAGGTCATAA AAATATATAT 

4411 AnTGGGCTG GGCGCAGTGG CTCACACCTG CGCCCGGCCC TTTOGGAGGC CGAGGCAGGT GGATCACCTG 

4481 AGTTTGGGAG TTCCAGACCA GCCTGACCAA CATGGAGAAA CCCCTTCTCT GTGTATTTTT AGTAGATTTT 

4551 ATTTTATGTG TATTTTATTC ACAGGTATTT CTGGAAAACT GAAACTGTTT TTCCTCTACT CTGATACCAC 

4621 AAGAATCATC AGCACAGAGG AAGACITCTG TGATCAAATG TGGTGGGAGA GGGAGGTTTT CACCAGCACA 

4691 TGAGCAGTCA GTICTGCCGC AGACTCGGCG GGTGTCCTTC GGTTCAGTTC CAACACCGCC TGCCTGGAGA 

4761 GAGGTCAGAC CACAGGGTGA GGGCTCAGTC CCCAAGACAT AAACACCCAA GACATAAACA CCCAACAGGT 

4831 CCACXCCGCC TGCTGCCCAG GCAGAGCCGA TTCACCAAGA CGGGAATTAG GATAGAGAAA GAGTAAGTCA 

4901 CACAGAGCCG GCTGTGCGGG AGAACGGAGT TCTATTATGA CTCAAATCAG TCTCCCCAAG CATTCGGGGA 

4971 TCAGAGTTTT TAAGGATAAC TTAGTGTGTA GGGGGCCAGT GAGTTGGAGA TGAAAGCGTA GGGAGTCGAA 

5041 GGTGTCCTTT TGCGCCGAGT CAGTTCCTGG GTGGGGGCCA CAAGATCGGA TGAGCCAGTT TATCAATCCG 

5111 GGGGTGCCAG CTGATCCATG GAGTGCAGGG TCTGCAAAAT ATCTCAAGCA CTGATTGATC TTAGGTTTTA 

5181 CAATAGTGAT GTTACCCCAG GAACAATTTG GCGAAGGTCA GAATCTTGTA GCCTGTAGCT GCATGACTCC 

5251 TAAACCATAA T7TCTWTT7 GTTTTTTTTT mTATTTTT GAGACAGGGT CTCACTCTGT CACCTAGGCT 

5321 GGAGTGCAGT GGTGCAATCA CAGCTCACTG CAGCCCCTAG AGCGGCCGCC ACCGCGGTGG AGCTCCAATT 

5391 CGCCCTATAG TGAGTCGTAT TACAATTCAC TGGCCGTCGT TTTACAACGT CGTGACTGGG AAAACCCTGG 

5461 CGTTACCCAA CTTAATCGCC TTGCAGCnCA TCCCCCTTTC GCCAGCTGGC GTAATAGCGA AGAGGCCCGC 

5531 ACCGATCGCC CTTCCCAACA GTTGCGCAGC CTGAATGGCG AATGGCGCGA AATTGTAAAC GTTAATATTT 

5601 TGTTAAAATT CGCGTTAAAT TTTTGTTAAA TCAGCTCATT TTTTAACCAA TAGGCCGAAA TCGGCAAAAT 

5671 CCCTTATAAA TCAAAAGAAT AGACCGAGAT AGGGTTGAGT GTTGTTCCAG TTTGGAACAA GAGTCCACTA 

5741 TTAAAGAACG TGGACTCCAA CGTCAAAGGG CGAAAAACCG TCTATCAGGG CGATGGCCCA CTACGTGAAC 

5811 CATCACCCTA ATCAAGTTTT TTGGGGTCGA GGTGCCGTAA AGCACTAAAT CGGAACCCTA AAGGGAGCCC 

5881 CCGATTTAGA GCITGACGGG GAAAGCCGGC GAACGTGGCG AGAAAGGAAG GGAAGAAAGC GAAAGGAGCG 

5951 GGCGCTAGGG CGCTGGCAAG TGTAGCGGTC ACGCTGCGCG TAACCACCAC ACCCGCCGCG CTTAATGCGC 

5021 CGCTACAGGG CGCGTCCCAG GTGGCACTTT TCGGGGAAAT GTGCGCGGAA CCCCTATTTG TTTATTTTTC 

6091 TAAATACATT CAAATATGTA TCCGCTCATC AGACAATAAC CCTGATAAAT GCTTCAATAA TATTGAAAAA 

5161 GGAAGAGTAT GAGTATTCAA CATTTCCGTG TCGCCCTTAT TCCCTITnT GCGGCATTIT GCCTTCCTGT 

6231 TTTTGCTCAC CCAGAAACGC TGGTGAAAGT AAAAGATGCT GAAGATCAGT TGGGTGCACG AGTGGGTTAC 

6301 ATCGAACTGC ATCTCAACAG CGGTAAGATC CTTGAGAGTT TTCGCCCCGA AGAACGTTTT CCAATGATGA 

6371 GCACTTTTAA AGTICTGCTA TGTGGCGCGG TATTATCCCG TATTGACGCC GGGCAAGAGC AACTCGCTOG 

6441 CCGCATACAC TATTCTCAGA ATGACTTGGT TGAGTACTCA CCAGTCACAG AAAAGCATCT TACGGATGGC 

6511 ATGACAGTAA GAGAATTATG CAG1GCTGCC ATAACCATGA GTGATAACAC TGCGGCCAAC TTACTTCTGA 

6581 CAACGATCGG AGGACCGAAG GAGCTAACCG CTTTTITGCA CAACATGGGG GATCATGTAA CTCGCCTTGA 

6651 TCGTTGGGAA CCGGAGCTGA ATGAAGCCAT ACCAAACGAC GAGOGTGACA CCACGATGCC TGTAGCAATG 

6721 GCAACAACGT TGCGCAAACT ATTAACTGGC GAACTAdTA CTCTAGCTTC CCGGCAACAA TTAATAGACT 

6791 GGA TCGAGG C GGATAAAGTT GCAGGACCAC TTCTGCGCTC GGCCCTTCCG GCTGGCTGGT TTATTGCTGA 

6861 TAAATCTGGA GCCGGTGAGC GTGGGTCTCG CGGTATCATT GCAGCACTGG GGCCAGATGG TAAGCCCTCC 

6931 CGTATCGTAG TTATCTACAC GACGGGGAGT CAGGCAACTA TGGATGAACG AAATAGACAG ATCGCIGAGA 

7001 TAGGTGCCTC ACTGATTAAG CATTGGTAAC TGTCAGACCA AGTTTACTCA TATATACTTT AGATTGATTT 

7071 AAAACTTCAT TTTTAATTtA AAAGGATCTA GGTGAAGATC CTTTTTGATA ATCTCATGAC CAAAATCCCT 

7141 7AACGTGA GT TTTCGTTCCA CTGAGCGTCA GACCCCGTAG AAAAGATCAA AGGATCTTCT TGAGATCCTT 

7211 TTTTTCTGCG CGTAATCTGC TGCTTGCAAA CAAAAAAACC ACCGCTACCA GCGGTOGTZT GTTTGCCGGA 

7281 TCAAGAGCTA CCAACTCTTT TTCCGAAGGT AACTGGCTTC AGCAGAGCGC AGATACCAAA TACTGTCCTT 
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7351 CTAGTCTAGC CGTAGTTAGG CCACCACTPC AAGAACTCTG TAGCACCGCC TACATACCIC GCTCTGCTAA 
7421 TCCTGTTACC ACTOGCTGCT GCCAGTGGCG ATAAGTCGT G TCTTACCGGG 7IGGACTCAA GACGATAGTT 
7491 ACCGGATAAG GCGCAGCGGT CGGGCT3AAC GQGGGGTTCG TGCACACAGC CCAGCTIGGA GCGAACGACC 
7561 TACACCGAAC TGAGATACCT ACAGCGTGAG CTATGAGAAA GCGCCACGCT TCCCGAAGGG AGAAAGGCGG 
7631 ACAGGTATCC GGTAAGCGGC AGGGTCGGAA GAGGAGAGCS CACGAGGGAG CTTCCAGGGG GAAACGCCTG 
7701 GTATCTTTAT AGTCCTGTCG GGTTTCGCCA CCTCTGACTT GAGCGTCGAT TTTTGTGATG CTCGTCAGGG 
7771 GGGCGGAGCC TATGGAAAAA CGCCAGCAAC GCGGCXTTIT TACGGTTCCT GGCC77TTGC TCGCCTTITG 
7841 CTCACATGTT CTTTCCTGCG TTATCCCCTG ATTCTGTGGA TAACCGTATT ACCGCCTTTG AGTGAGCTGA 
7911 TACCGCTCGC CGCAGCCGAA CGACCGAGCG CAGCGAGTCA GTGAGCGAGG AAGCGGAAGA GCGCCCAATA 
7981 CGCAAACCGC CTCTCCCCGC GCGTTCGCCG ATTCATTAAT GCAGCTGGCA CGACAGGT7T CCCGACTCGA 
8051 AAGCGGGCAG TGAGCGCAAC GCAATTAATG TGAGTTAGCT CACTCATTAG GCACCCCAGG CTTTACACTT 
8121 TATGCTTCCG GCTCGTATGT TGTGTGGAAT TGTGAGCGGA TAACAATTTC ACACAGGAAA CAGCTATCAC 
8191 CATGATTACG CCAAGCTCGG AATTAACCCT CACTAAAGGG AACAAAAGCT GCTGCAGGGT CCCTAACTCC 
8261 CAAGCCCCAC AGTGTGCCCT GAGGCTGCCC CTTCCTTCTA GCGGCTGCCC CCACTCGGCT TTGCTITCCC 
8331 TAGTTTCAGT TACTT GCGTT CAGCCAAGGT CTGAAACTAG GTGCGCACAG AGCGGTAAGA CTGCGAGAGA 
8401 AAGAGACCAG CTTTACAGGG GGTTTATCAC AGTGCACCCT GACAGTCGTC AGCCTCACAG GGGGTTTATC 
8471 ACATTGCACC CTGACA GTCG TCAGCCICAC AGGGGGTTTA TCACAGTGCA CCCTTACAAT CATTCCATTT 
8541 GATTCACAAT TTTTTTAGTC TCTACTGTGC CTAACTTGTA AGTTAAATTT GATCAGAGGT CTCTICCCAG 
8611 AGGGGAAAAC AGTATATACA GGGTTCAGTA CTATCGCATT TCAGGCCTCC ACCTGGGTCT TGGAATGTCT 
8681 CCCCCGAGGG GTGATGACTA CCTCAGTTGG ATCTCCACAG GTCACAGTGA CACAAGATAA CCAAGACACC 
8751 TCCCAAGGCT ACCACAATGG GCCGCCCTCC ACGTGCACAT GGCCGGAGGA ACTGCCATGT CGGAGGTGCA 
6821 AGCACACCTG CGCATCAGAG TCCTTGGTGT GGAGGGAGGG ACCAGCGCAG CTTCCAGCCA TCCACCTGAT 
8891 GAACAGAACC TAGGGAAAGC CCCAGTTCTA CTTACACCAG GAAAGGC 
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FIGURE 11 

1 7GGAAGGGCT AATTTSG7CC OttAAAAGAC AAGAGA TCCT TGATCTGTGG ATCTACCACA CACAAGGCTA 

71 CTTCCCTGAT TGGCAGAACT ACACACCAGG GCCAG GGATC AGATATCCAC TGACCTTTGG A7GGTGCTTC 

141 AAGTTAGTAC CAGTTGAACC AGAGCAAGTA GAAGAGGCCA AATAAGGAGA GAAGAACAGC TTGTTACACC 

211 CTATGAGCCA GCATGGGATG GAGGACCCGG AGGGAGAAGT ATTAGTGTGG AAGTTTGACA GCCTCCTAGC 

281 ATTTCGTCAC ATGGCCCGAG AGCTGCATCC GGAGTACTAC AAAG ACTGCT GACATCGAGC TTTCTACAAG 

351 GGACTTTCCG CTGGGGACTT TCCAGGGAGG TGTGGCCTGG GOGGGACT GG GGAGTGGCGA GCCCTCAGAT 

421 GCTACATATA AGCAGCTGCT TTITGCCTGT ACTGGGTCTC TCTGGTTAGA CCAGATCTGA GCCTCGGAGC 

491 TCTCIX3GCTA ACTAGGGAAC CCACTGCTTA AGCC TCAATA AAGCTTGCCT TGAGTG C TCA AAGTAGTGTG 

— I 

561 'KXCCGTCTG TTGTGTGACT CTGGTAACTA GAGATCCCTC AGACCCTTTT AGTCAGTGTG GAAAATCTCT 

631 AGCAGTGGCG CCCGAACAGG^G ACTTGAAAG CGAAAGTAAA GCCAGAGGAG ATCTCTCGAC GCAGGACTCG 

BssHII (711) 

701 GCTTGCTGAA G3333:acgcr caacaggcca cgggcggcgC cicATr^Gai ccrcaaaaa- — gacracc 

Clal (830) 

771 ggaggczaga aggagagagC 7ZG7?XZ?G .^C3?C^. TT.-^C^ a-a-.-.TT.-^. TCGATGGGAA 



841 AAAA.. W C-*. TAAGGCCAGG GGGAAAGAA3 AAGTACAAGC 7AAAGCAIA7 CGTA7CGGCA AGCAGGGAGC 



ci i 


7ASAAC3A77 


CGZAG77AA7 


C ZTGGCCTG7 


7AGAAACATC 


Accl (959) 

A3AASGITG? A3ACAAATAC TGGGACAGCT 


981 


ACAACCAPrr 




GA7CAGAGGA 


G 777CGA7CA 


CTATACAACA 


CAGTAGCAAC CCTCTATTGT 


1051 


GTGCACCAGC 


GGATCGAGA7 


CAAGGACACC 


AAGGAAGC7T 


TAGACAAGA7 


AGAGGAAGAG CAAAACAAGT 


1121 


*-*-«AuAAGAA 


G^jfCC CAOC AG 


G«nG0AG77G 




CAGCAA7CAG 


G7CAGCCAAA ATTACCCTAT 


1191 


noTGCAGAAC 


ATCCAGGGGC 


AAA7GGTAZA 


7CA&GCCA7A 


TCrtC CTAGrui 


C7TTAAACGA TAAGCTTGGG 


1261 


AGTTCCGCGT 


TACATAACTT 


ACGGTAAATG 


GCCCGCCTGG 


CTGACCGCCC 


AACGACCCCC GCCCATTGAC 


1331 


G7CAATAATG 


ACGTATGTTC 


CCATAGTAAC 


GCCAATAGGG 


ALTITCCATT 


GACGTCAATG GGTGGAGTAT 


1401 


ttacggtaaa 


CTGCCCACTT GGCAGTACAT CAAGTGTATC 


ATATGCCAAG 


TACGCCCCCT ATTGACGTCA 


1471 


ATGACGGTAA 


ATGGCCCGCC 


TGGCATTATG 


CCCAGTACAT 


GACCTTATGG 


GACTTTCCTA C'lTGGCAGTA 


1541 


CATC7ACGTA 


TTAGTCATCG 


CTA7TACCAT 


GCTGATGCGG 


TTTT5GCAGT 


ACATCAATGG GCGTGGATAG 
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1611 CGGTTrGACT CACGGGGATT ICCAAGTCTC CACCCCA TO ACGTCAATCG GAGTTXTTT TGGCACCAAA 

1681 ATCAACGGGA CTTTCCAAAA TGTCGTAAC A ACTCCGCCCC A73GACGCAA ATGGGCGG7A GGCGTGTACG 

1751 GTGGGAGG7C TATATAAGCA GAGCTCGTTT AG TGAACCGT CAGATCGCC7 GGAG=»CGCCA TCCACGCTC7 

1821 TTTGACCTCC ATAGAAGACA CCGACTCTAG AGgacccATC TAAGTAAGCT TGGCAT7CCG GTAC^GTTGG 
^ 

1891 TAAAATCGAA GACGCCAAAA ACATAAAGAA AGGCCCGGCG CC^TTCTATC CTC7AGAGGA TGGAACCGCT 
1961 GGAGAGCAAC TGCATAAGGC TATGAAGAGA 7ACGCCCTGG TTCCTGGAAC AATTGC7TTT ACAGATGGAC 
2031 ATATCGAGGT GAACATCACG TACGCGGAAT ACTTCGAAAT GTCCGTTCGG TTGGCAGAAG CTATGAAACG 
2101 ATATGGGCTG AATACAAATC ACAGAATCGT CGTATGCA GT GAAAACTCTC TTCAATTCTT TATGCCGGTC 
2171 TTGGGCGCGT TATZTATCGG AGTTGCAGTT GCGCCCGCGA ACGACATTTA TAAtGAACGT GAATTGCTCA 
2241 ACAGTATGAA CATTTCGCAG CCTACCGTAG TO ' : , rG : , : , lV CAAAAAGGGG 7TGCA-A.AAA TTTTGAACGT 
2311 GCAAAAAAAA TTACCAATAA TCCAGAAAAT TATTATCATG GATTCTAAAA CGGAT7ACCA GGGATTTCAG 
2381 TCGATGTACA CGTTCGTCAC ATCTCATCTA CCTCCCGGTT 1TAATGAATA CGATTTTGTA CCAGAGTCCT 
2451 TTGATCGTGA CAAAACAATT GCAC7CATAA TGAATTCCTC TGGATCTACT GGGTTACCTA AGGGTGTGGC 
2521 CCTTCCGCAT AGAACTGCCT GCGTCAGA7T CTC GCATGCC AGAGATCCTA TTTTrGGCAA TCAAATCATT 
2591 GCGGATACTG CGATTTTAAG TGTT GTTCCA TTCCATCACG GTTTTGG.AAT GTTTACTACA CTCGGATATT 
2651 TCATA7GTGG ATTTCGAGTC GTCT TA-.TGT ATAGATTTGA AGAAGAGCTG TTTTTArGAT CCCTTCAGGA 
2731 T**ACAAAATT CAAAGTGCGT TGCTAGTACC AACCCTATTT TCATTCTTCG CCAAAAGCAC TCTGATTGAC 
2801 ^AATACGATT TATCTAATTT ACACGAAATT GC7TCTCGGG GCGCACCTCT TTCGAAAGAA GTCGGGGAAG 
2871 CGG7TGCAAA ACGCTTCCAT CTTCC AGGGA 7AC3AIAAGG ATATGG3CTC ACTGAGACTA CATCAGCTAT 
2941 TCTGATTACA CCCGAGGGGG ATGATAAAC C G3CC3CG37C GG7AAAGTTG T7CCA7TTTT TGAAGCGAAG 
30U CTGTGGATC TGGATACCGG GAAAACGCTC GGCG77AA7C AGAGAGGCGA ATTATGTGTC AGAGGACCTA 
3081 TGATTATGTC CGGTTATGTA AACAA7CCGG AAGCGACCAA CGCC7TGATT GACAAGGATG GATGGCTACA 
315 1 ^CTGGAGAC ATAGCTTACT GGGACGAAG A CGAAGACT7C TTCA7AGTTG ACCGC7TGAA GTCTTTAATT 

Clal (3259) 

3221 AAATACAAAG GATATCAGGT G GCCCCCGCT GAATTGGAAT CGATATTGTT AGAACACCCC AACATCTTCG 
3291 "-GCGGGCGT GGCAGGTCTT CCCGAGGATG ACGCCG 3TGA ACTTCCCGCC GCCGTTGTTG TTTTGGAGCA 
3351 CGGAAAGACG ATGACGGAAA AAGAGA TCGT GGA7TACGTC GCCAGTCAAG TAACAACCGC GAAAAAGTTG 
3431 CGCGGAGGAC TTGTGTrrGT GGACGAAGTA CCGAAAGGTC T7ACCGGAAA AC7C3ACGCA AGAAAAATCA 
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Apal (3557) 
Xhof (3548) Kpnl (3563) 

3501 GAGAGATCCT CATAAAGGCC AAGAAGGCCG GAAAGTCCAA ATTGTAAcTC GAGGGGGGGC CCGGTACCTT 

. 

3571 TAAGACCAAT GACTTACAAG GCAGCTGTAG ATCTTAGCCA CTTTTTAAAA GAAAAGGGGG GACTGGAAGG 



3641 GCTAATTCAC TCCCAAAGAA GACAAGATAT CCTTGATCTG TGGATCTACC ACACACAAGG CTACTTCCCT 

3711 GATIGGCAGA ACTACACACC AGGGCCAGGG GTCAGATATC CACTGACCTT TGGATGGTGC TACAAGCTAG 

3781 TACCAGTTGA GCCAGATAAG GTAGAAGAGG CCAATAAAGG AGAGAACACC AGCTTGTTAC ACCCTGTGAG 

3851 CCTGCATGGA ATGGATGACC CTGAGAGAGA AGTGTTAGAG TGGAGGTTTG ACAGCCGCCT AGCATTTCAT 

3921 CACGTGGCCC GAGAGCTGCA TCCGGAGTAC TTCAAGAACT GCTGACATCG AGCTTGCTAC AAGGGACTTT 

3991 CCGCTGGGGA CTTTCCAGGG AGGCGTGGCC TGGGCGGGAC TGGGGAGTGG CGAGCCCTCA GATGCTGCAT 

4061 ATAAGCAGCT GCTTTTTGCC TGTACTGGGT CTCTCTGGTT AGACCAGATC TGAGCCTGGG AGCTCTCTGG 
» 

4131 CTAACTAGGG AACCCACTGC TTAAGCCTCA ATAAAGCTTG CCTTGAGTGC TTCAAGTAGT GTGTGCCCGT 


4201 CTGTTGTGTG ACTCTGGTAA C7AGAGATCC CTCAGACCCT TTTAGTCAGT G7GGAAAATC TCTAGCACCC 


4271 CCCAGGAGGT AGAGGTTGCA GTGAGCCAAG ATCGCGCCAC TGCATTCCAG CCTGGGCAAG AAAACAAGAC 

4341 TGTCTAAAAT AATAATAATA AGTTAAGGGT ATTAAATATA TTTATACATG GAGGTCATAA AAATATATAT 

4411 ATTTGGGCTG GGCGCAGTGG CTCACACCTG CGCCCGGCCC TTTGGGAGGC CGAGGCAGGT GGATCACCTG 

4481 AGTTTGGGAG TTCCAGACCA GCCTGACCAA CATGGAGAAA CCCCTTCTCT GTGTATTTTT AGTAGATTTT 

4551 ATTTTATGTG TATTTTATIC ACAGGTATTT CTGGAAAACT GAAACTGTTT TTCCTCTACT CTGATACCAC 

4621 AAGAATCATC AGCACAGAGG AAGACTTCTG TGATCAAATG TGGTGGGAGA GGGAGGTTTT CACCAGCACA 

4691 TGAGCAGTCA GTTCTGCCGC AGACTCGGCG GGTGTCCTTC GGTTCAGTTC CAACACCGCC TGCCTGGAGA 

4761 GAGGTCAGAC CACAGGGTGA GGGCTCAGTC CCCAAGACAT AAACACCCAA GACATAAACA CCCAACAGGT 

4831 CCACCCCGCC TGCTGCCCAG GCAGAGCCGA TTCACCAAGA CGGGAATTAG GATAGAGAAA GAGTAAGTCA 

4901 CACAGAGCCG GCTGTGCGGG AGAACGGAGT TCTATTATGA CTCAAATCAG TCTCCCCAAG CATTCGGGGA 

4971 TCAGAGTTTT TAAGGATAAC TTAGTGTGTA GGGGGCCAGT GAGTTGGAGA TGAAAGCGTA GGGAGTCGAA 

5041 GGTGTCCTTT TGCGCCGAGT CAGTTCCTGG GTGGGGGCCA CAAGATCGGA TGAGCCAGTT TATCAATCCG 

5111 GGGGTGCCAG CTGATCCATG GAGTGCAGGG TCTGCAAAAT ATCTCAAGCA CTGATTGATC TTAGGTTITA 

5181 CAATAGTGAT GTTACCCCAG GAACAATTTG GGGAAGGTCA GAATCTTGTA GCCTGTAGCT GCATGACTCC 

5251 TAAACCATAA TTTCTITTTr GTTTTTTTTT TTTTATTTTT GAGACAGGGT CTCACTCTGT CACCTAGGCT 

5321 GGAGTGCAGT GGTGCAATCA CAGCTCACTG CAGCCCCTAG AGCGGCCGCC ACCGCGGTGG AGCTCCAATT 

5391 CGCCCTATAG TGAGTCGTAT TACAATTCAC TGGCCGTCGT TTTACAACGT CGTGACTGGG AAAACCCTGG 

5461 CGTTACCCAA CTTAATCGCC TTGCAGCACA TCCCCCTTTC GCCAGCTGGC GTAATAGCGA AGAGGCCCGC 

5531 ACCGATCGCC CTTCCCAACA GTTGCGCAGC CTGAATGGCG AATGGCGCGA AATTGTAAAC GTTAATATTT 

5601 TGTTAAAATT CGCGTTAAAT TTTTGTTAAA TCAGCTCATT TTTTAACCAA TAGGCCGAAA TCGGCAAAAT 

5671 CCCTTATAAA TCAAAAGAAT AGACCGAGAT AGGGTTGAGT GTTGTTCCAG TTTGGAACAA GAGTCCACTA 

5741 TTAAAGAACG TGGACTCCAA CGTCAAAGGG CGAAAAACCG TCTATCAGGG CGATGGCCCA CTACGTGAAC 

5811 CATCACCCTA ATCAAGTTIT TTGGGGTCGA GGTGCCGTAA AGCACTAAAT CGGAACCCTA AAGGGAGCCC 

5881 CCGATTTAGA GCTTGACGGG GAAAGCCGGC GAACGTGGCG AGAAAGGAAG GGAAGAAAGC GAAAGGAGCG 

5951 GGCGCTAGGG CGCTGGCAAG TGTAGCGGTC ACGCTGCGCG TAACCACCAC ACCCGCCGCG CTTAATGCGC 

6021 CGCTACAGGG CGCGTCCCAG GTGGCACTTT TCGGGGAAAT GTGCGCGGAA CCCCTATTTG TTTATTITTC 

6091 TAAATACATT CAAATATGTA TCCGCTCATG AGACAATAAC CCTGATAAAT GCTTCAATAA TATTGAAAAA 

6161 GGAAGAGTAT GAGTATTCAA CATTTCCGTG TCGCCCTTAT TCCCTTITTT GCGGCATTIT GCCTTCCTGT 

6231 TTTTGCTCAC CCAGAAACGC TGGTGAAAGT AAAAGATGCT GAAGATCAGT TGGGTGCACG AGTGGGTTAC 

6301 ATC GAACT GG ATCTCAACAG CGGTAAGATC CTTGAGAGTT TTCGCCCCGA AGAACGTTIT CCAATGATGA 

6371 GCACTTTTAA AGTTCTGCTA TGTGGCGCGG TATTATCCCG TATTGACGCC GGGCAAGAGC AACTCGGTCG 

6441 CCGCATACAC TATTCTCAGA ATGACTTGGT TGAGTACTCA CCAGTCACAG AAAAGCATCT TACGGATGGC 

6511 ATGACAGTAA GAGAATTATG CAGTGCTGCC ATAACCATGA GTGATAACAC TGCGGCCAAC TTACTTCTGA 

6581 CAACGATCGG AGGACCGAAG GAGCTAACCG CTTTTTTGCA CAACATGGGG GATCATGTAA CTCGCCTTGA 

6651 TCGTTGGGAA CCGGAGCTGA ATGAAGCCAT ACCAAACGAC GAGCGTGACA CCACGATGCC TGTAGCAATG 
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6721 GCAACAACGT TGCGCAAACT ATTAACTGGC GAACTACTTA CTCTAGCTTC CCGGCAACAA TTAATAGACT 
6791 GGATGGAGGC GGATAAAGTT GCAGGACCAC TTCTGCGCTC GGCCCTTCCG GCTOGCTOGT TTATTGCTGA. 
6861 TAAATCTGGA GCCGGTGAGC CTGGCTCTCG CGGTATCA7T GCAGCACTGG GGCCAGATGG TAAGCCCTCC 
6931 CGTATCGTAG TTATCTACAC GACGGGGAGT CAGGCAACTA TGGATGAACG AAATAGACAG ATCGCTGAGA 
7001 TAGGTGCCTC ACTG ATTAAG CATTGGTAAC TGTCAGACCA AGTTTACTCA TATATACTTT AGATTGATTT 
7071 AAAAOTCAT TnTAATTTA AAAGGATCTA GGTGAAGATC CTmTGATA ATCTCATGAC CAAAATCCCT 
7141 TAACGTCAGT TTTCGTTCCA CTGAGCGTCA GACCCCGTAG AAAAGATCAA AGGATCTTCT TGAGATCCTT 
7211 TTTTTCTGCG OGTAATCTGC TGCTTGCAAA CAAAAAAACC ACCGCTACCA GCGGTGGTTT GTTTGCCGGA 
7281 TCAAGAGCTA CCAACTCTIT TTCCGAAGGT AACTGGCTTC AGCAGAGCGC AGATACCAAA TACTGTCCTT 
7351 CTAGTGTAGC CGTAGTTAGG CCACCACTIC AAGAACTCTG TAGCACCGCC TACATACCIC GCTCTGCTAA 
7421 TCCTGTTACC AGTGGCTGCT GCCAGTGGCG ATAAGTCGTG TCTTACCGGG TTGGACTCAA GACGATAGTT 
7491 ACCGGATAAG GCGCAGCGGT CGGGCTGAAC CGGGGGTTCG TGCACACAGC CCAGCTTGGA GCGAACGACC 
7561 TACACCGAAC TGAGATACCT ACAGCGTGAG CTATGAGAAA GCGCCACGCT TCCCGAAGGG AGAAAGGCGG 
7631 ACA GGTAT CC GGTAAGCGGC AGGGTCGGAA CAGGAGAGCG CACGAGGGAG CTTCCAGGGG GAAACGCCTG 
7701 GTATCTTTAT AGTCCT2TCG GGTITCGCCA CCTCTGACTT GAGCGTCGAT TTTTGTGATG CTCGTCAGGG 
7771 GGGCGGAGCC TATGG AAAAA CGCCAGCAAC GCGGCCTTTT TACGGTTCCT GGCCTTTTGC TGGCCTTTTG 
7841 CTCACATGTT CTTTCCTGCG TTATCCCCTG ATTCTGTGGA TAAOCGTATT ACCGCCTTO AGTGAGCTGA 
7911 TACCGCTCGC CGCAGCCGAA CGACCGAGCG CAGCGAGTCA GTGAGCGAGG AAGCGGAAGA GCGCCCAATA 
7981 CGCAAACCGC CTCTCCCCGC GCGTTGGCCG ATTCATTAAT GCAGCTGGCA CGACAGGTTT CCCGACTGGA 
8051 AAGCGGGCAG TGAGCGCAAC GCAATTAATG TGAGTTAGCT CACTCATTAG GCACCCCAGG CTTTACACTT 
8121 TAT3CTTCCG GCTCG7ATGT T3TCTGGAAT TGTGAGCGGA TAACAATTTC ACACAGGAAA CAGCTATGAC 
8191 CATGATTACG CCAAGCTCGG AATTAACCCT CACTAAAGGG AACAAAAGCT GCTGCAGGGT CCCTAACIGC 
8261 CAAGCCCCAC AGTGTGCCCT GAGGCTGCCC CTTCCT7CTA GCGGCTGCCC CCACTCGGCT TTGCTTTCCC 
8331 TAGTTTCAGT TACTTGCGTT CAGCCAAGGT CTGAAACTAG GTGCGCACAG AGCGGTAAGA CTCCGAGAGA 
8401 AAGAGACCAG CTTTACAGGG GGTTTATCAC AGTGCACCCT GACAGTCGTC AGCCTCACAG GGGGTTTATC 
8471 ACATTGCACC CTGACAGTCG TCAGCCTCAC AGGGGGTTTA TCACAGTGCA CCCTTACAAT CATTCCATTT 
8541 GATTCACAAT TTTTTTAGTC TCTACTGTGC CTAACTTGTA AGTTAAATTT GATCAGAGGT GTGTTCCCAG 
3611 AGGGGAAAAC AGTATA7ACA GGGTTCAGTA CTATCGCATT TCAGGCCTCC ACCTGGGTCT TGGAATGTGT 
S681 CCCCCGAGGG GTGATGACTA CCTCAGTTGG ATCTCCACAG GTCACAGTGA CACAAGATAA CCAAGACACC 
S751 TCCCAAGGCT ACCACAATGG GCCGCCCTCC ACGTGCACAT GGCCG3AG3A ACTGCCATGT CGGAGGTGCA 
6821 AGCACACCTG CGCATCAGAG TCCTTG3TGT GGAGGGAG3G ACCAGCGCAG CTTCCAGCCA TCCACCTGAT 
S831 GAACAGAACC TAGGGAAAGC CCCAGTTCTA CTTACACCAG GAAAGGC 
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FIGURE 12 
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FIGURE 13 



Gag production from the Rev-lndopendent gag-pol HIV* 
1 vector pCMVBNkan 
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FIGURE 14 



Reverse transcriptase activity from the Rev- i 
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FIGURE 15A 
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FIGURE 15B 



Transduction of lymphoid cells (Jurkat) 
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FIGURE 15C 
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FIGURE 15D 
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FIGURE 16 
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FIGURE 17 

1 CCTGGCCA1T GCATACGTTG tatccatatc ataatatgta catttatatp goctcatctc caacattooc 

71 GCCATGTTGA CATTGATTAT TGACTAGTTTA TTAAT?£TAA TCAATPACGG GOTCATIACT TCATAGOXA 

141 TATATGGAGT TCCGCCTTAO ATAACCTACG CTTAAATUGOC COOCIXJOC1X3 AOfXXXCAAC GACrXKXJQOC 

211 CA3TGACOTC AATAAT3ACG TATOTTCCCA TAGTAACGOC AATAGGGACT TO^\TO3AC GTCAATO0L7T 

281 GQAOTATTTA CGGTAAACTG CCC&CITOGC AOTACATCAA GiOTATCATA TO2CAA0TAC GCCCCCrA.TT 

351 GAGGTCAATG ACGGTAAATC GCCCGCCPGG CXTTATGCOC AGlftCA r iK3AC CMTATOGGAC TTTCXTTACTT 

421 GGCACTACAT CTACOTATTA OTCATCXXTTA TTAOCATOCyT GA.TOXGTTT TOGCAGKACA TCAATGGQCG 

491 TGGATAGCGG TTTGACTCAC GGGGATTPCC AACTPCTCCAC CCCATPGACX3 TCAATOGOAG TTTOPTTTOO 

561 GACCAAAATC AACGGGACTT TCCAAAATGT CTTAACAACT G:GCCCCATT GACGCAAATO GGCGGTAGGC 

631 GfTOi , ACGCnV2 CHAGCnCTAT ATAAGCAGAG CTCX7JTrAj:,T GAACCCTCAG ATCQCCICGA GACGCCATDC 

BssHII (762) 
Sacll (746) 

701 ACGCTOHTT CffiCCrOCATA GAAGACACCG GGAOCGATCC AGCCTCC'GCG GC3C3CGCTM GTATO30ATG 
i i n " i t nn ■ -M i h i m i r 1> M G C 

771 TC1TGGGAAT CAGCTGC1TA ircXXATCTT CCTTTTAACT SiVTATGGGA TCTATr^PAC TCTATA'IXTFC 
3> L G N QLL I A I U L L S VYG IYCT L V V 

841 ACAQTCITTT ATQGTOTACC AGCFTOGftGG AATOCGACAA TTCCCCTC7T TTCTGCAACC AAGAATAGGG 
27> T V F Y G V P A W R NAT IPLF CAT K N R 

911 ATACTTGGGG AACAACTCAG TGCCTACCAG ATAATGGTOA TTATOGAGAA GTCGCCCrTA ATCTTftCAGA 
5G»D TWG T.TQ CLP DNGD YSE V A L N V T E 

981 AAGCT7TCAT .'JCCTV3GAATA ATACAC7TCAC AGAACAOGCA ATAGAGGATG TATOGCAACT OTTTCAGACt.: 
73* 9 F D A W N NTYT E Q A IEO .VWQL FET 
1051 TCAATOAAGC CITGTGTAAA ATTATCCCCA T7ATGCATTA CTATG&GATG C\M7*AAAG'T GAGACAGATA 

97* S I' K PCVK L8P L C i* TMRG N KS ETD 
1121 GATGGGGATT GACAAAATCA ATAACAACAA CAGCATCAAC AACA2BCAAGG ACAGCA1CAG C?iAAAGTAGA 

i20>- RWGL TKS ITT TA8T TS'T TAS AKVD 
1191 C7tf"GGTCAAT GAGACTAGrTT OTVyrATAGC CCAGCiATAAT TGCACAGGCT TO3AACAAGA GCAAATGATA 

143> M VN ETS SCIA QDN CTG LEQE G M i 
1261 AGCTOTAAAT TCAACATOAC AGOOTCAAAA AGAGACAAGA AAAAAGAGTA CAATGAAACT TGGTACTCTG 

167* SCK FNMT GLK ROK K KEY NET WYS 
1331 CAGATTTGOT A'KJTCAACAA GGGAATAACA CTGC3TAATGA AAGTAGATCT TACATCAACC ACTCTAACAC 

190>A 0 L V OEQ 3 N N TGNE SRC Y M N HCNT 
1401 TTCTGTTATC CAAGAGrCTT CTGACAAACA TTATTOGGAT GCTATPAGAT TTAGGTATTG TOCACCTOCA 

213* S V I QE3 CDKH YWD AIR FRYC APP 
1471 GGTTATGCTT TTJCTTAGATG TAATCACACA AATTATPCAG GCTPTATGCC TAAATOTrCP AAGCJTGGTCG 

237^ GYA LLRC NDT NYS GFMP KGS KVV 
1541 TCTCnCATC CACAAOGATG ATOGAGACAC AQACTTCTAC ITGCiTTCCC TTFAA'PCXSAA CTAGAGCAGA 

^►V SSC TRM MET QTST.WFG FNG TRAE 
1611 AAATAOAACT TOPATTPACT GQCATGGTAG GGATAATAfiG ACTATAATTA CTTT7AATAA GTATPATAAT 

283>- NRT Y!Y W HG R DNR TI I SLNK YYN 
1681 CHAACAATGA AATGTA?3AAG ACCAGGAAAT AAGACAGTTr TACCAGTCAC CA'ITATGTCP GCMTGffPPP 

307^ LTM KCRR PGN KTV LPVT IMS GLV 
1751 TCCACTCACA ACCAATCAAT GA1CAGGCCAA AGCAGGCATG GTOTTOGTTT GGAGGAAAA? GG«AG(aA!PX 

330^F MSG PI N DRP KQAW CWF GGK WKDA 
1821 AATAAAAGAG GTGAAGCAGA CCATTGTCAA ACA1COCAGG TA'EACPGGAA CIAACAATAC TGATAAAATC 

353^ IKE VKQ TIVK HPR YTG T N N T D K I 
1891 AA1TTCACGG CPCCTOGAO^ AGGAGATCCG GAACfTT7%CCn' TCATCfKXJAC AAATTGCAGA GGAGAOTIXIC 

377^ N LT APGG GDP EVT FWWT NCR GEF 
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1961 TCTACTGTTAA AATGAATTGG TTTCTAAATT G3GTAGAAGA TAGGAATACA GCPAACCAGA AOCCAAAGGA 

400>L Y CK MNW FLN W V E D P. NT ANQ K?K£ 
2031 ACAGCATOAA AQGAATTACG TGCCATOTCA TAFTAGACAA ATAATCAAGA CTTOGCATAA AGTO3GCAAA 

423» Q HK RNY VPCH I RQ I I N TWHK VQK 
2101 AATGOTTATT TOXTOCAAG AGiV^GAGAC CTCACCTGTA AClCf^CAGT t3AOCAGTCTC ATAGCAAACA 

447* N V Y LPPR-SGO LTC NSTV T S L IAN 
2171 TAGATTGGAT TGATGGAAAC CAAACTAATA IX^OCATCAG TGCAGAGGTG GCAGAACTGT ATCGATTGGA 

47QH OWI DON G T N ITMSAEVA.EL Y R L E 
2241 ATTGGGAGAT TATAAATTAG TAGAGATCAC TCCAA'tTCGC TTGGaXJCCA CAGATGTCSAA GAGOTACfiCT 

493> LQ D YKL V E I T PI Q LAP TDVK RYT 
2311 ACTQGTGGCA CCTCAAGAAA TAAAAGAGGG GTCTTrCTTGC TAGCGrPCTT 1 i^mTCTC CUZAACGGCAC 

517> T G G TSRN KRQ V F V L G F L Q F L , A T A 
2381 GTTCTCCAAT GGGAGCCGCC AGCCTGACCC TCAOSGCACA CTCCOGAAOT TTATO3GCTC GGMAGTCCA 

540>G SAM G A A 3LT LTAQ SRT LLA GfVQ 
2451 ACAGCAGCAA CAGCTC7TTGG ACX7TGC7TCAA GAGACAACAA GAATJ^TTGC C1ACTGA<TCOT CTCGGOAACK 

563* QQQ QLL OVVK RQQ ELL RLT V WGT 
2521 AAGAACCTCC AGACTAGGCT CACTGCCATC GAGAACTACT TAAAGGACGA CKXGCAGCTG AATCCTH3GG 

587> KNL QTRV T A I E K Y LKOQAQL NAW 
2591 GATGTGCGTT T?£2ACAAOTC TGCCACACTA CTOKiOCATG GCCAAATCCA AGPOTAACAC C&AAGTCGAA 

610*G CAF ft Q V GHT TVPW PNA SLT PKWN 
2661 CAATGAGACT 'TOGCAAGACT GGGAGCGAAA GGTTGACiTC TfGGAAGAAA A'FATAACAGC CC'rcCTAGAfi 

633» NET WQE WERK VDF LEE NITA LLE 
2731 GAGGCACAAA TTCAACAAGA GAAGAACATC TAlCAATlTvC AAAAOTTGAA TAGCTGGGAT GTCTTTGGCA 

657* EAQ IGQE KNM YEL QKLN SWD VFG 
2801 AlTQGn^TGA CCTTGCPTCl* TGGATAAAGT ATATACAATA TCGAGnTITAT ATACTTTKaTAG GAGTAATACT 

680^N WF0 LAS WIK YIQY GVY I V V G V I L 
2871 GTTAAGAATA GTGATCPATA TAOTACAAAT GCTAGCTAAG TTAAGGCAGG <3CTATAGGCC A^IGTOTICT 

703> LRI V ! Y IVQM LAK LRQ GYRP VFS 
2941 TCCCCACCCT CVFATTTCCh GC^GAOCCAT A7CCAAGA33 ACCCGGCACT GOCAACCAGA GAAGGCAAAC 

727> SPP SYFQQTH I Q Q OPAL PTR EGK 
3011 AAAGAGACOG TO3AGAAGGC GGTGGCAAOA GCTOOTGGOC 1TOGOAGATA GAATATATCe ACHnCrTA? . 

750»E ROG GEG GGN SSWPWQi EYI HFLI 
3081 TCGTCAGCTT ATTAGACTCT TCACTTGGCT ATTCAC7TAAC TCTAGGACTT TQCTATOIAG AOTATACCAG 

773> RQL i R L LTWL rSN CRT LLSR VYQ 
3151 ATCCTCCAAC CAATACTCCA GAGGCTCICT GCGACXXITAC AGAGGAII^CG AGAAGTOCTC AGGACTGAAC 

797* f L Q PILQ RLS A T L QRIR EVL RTE 
3221 TGACCTACCT ACAATATGGG TGGAGCTATT TCCATGAGQC GGTCCAGGCC GTCTGGAGAT CTGGCGACCG 

820K TYL QYG WSY FHEA V Q A VWR SGDR 
3291 ACAOAGACTC TTC5CGGGCGC C&V3GGGAGAC TTATGGGAGA CTCTTAGAGA CTCTTGOGGG CGCGTGGGGA 

843* QRL L R A RGET YGR LLE TLAG AW-<S 
3361 OvCTTATGGG AGACTCIM'AjG GAGAGGAGAG GTGGAAGATG GATACTCGAG '3TGGAAGATC GAI^CTCGCC 

S67^.D L W ETLR RGE VED GYSR WKM DTR 
3431 AAATCCATCC OClAG»3AGGAT TAGACAAGG*3 CTK^AGCTCA CTCrCTTGTG AfXAGGKJGA TTAGACAAGG 

890^Q I HP Q E D 

BamHI (3539) 

3501 GCTTGAGCTC ACTCTCTIOT GAGGGACAGA GGGACACiAqg atccaccagt ti;D3gaCTCG AG3GGGGGCC 
3571 OGGTACGAGC GCTTAGCT?^ CT^AGACCA CCTOCCCTGC GAGCTAAGCT GGACAGCCAA TGACGGGTAA 

3641 GAGAGTGACA TTTTTCAC'rA ACCTAAGACA GGAGGGCCGT CAGAQCTACT GOCTAATCCA AAGACGGCTA 



3711 7AAGTCATAA AAATGTATCA CTCCAACCTA AGACAGGCGC AGCTICCGAG GGATTrGTCG TCTCTTTTAT 
3781 ATATATTTAA AAGGGTGACC TGTCCGGAGC CG TGCTCCOC GGATGATGTC TTGGTCTAGA CTCGRGGGGG 
3851 GGCCCGGTAC GATCCACATC TCCTCTGCCT TCTAGTIK^XV AGCC^TCTGT 'V&VTTWX'.CC TOXCCGTOC 



3921 CrrCCTTGAC tXTGGAAGCT -3CCACTCCX' A . <7Ita' , rCCXTIC CAAimW GAOSAAATTG CAltXCASTO 
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3991 tctc»otwm TOKarrcra 'trcaxssaoM tcogxtjoxig vjk<£J€xx:& Aa&woAGGA rasK&Acae 

Kpnl (4097) 

4061 AATOOCAOGC ATGCPGGCX?A TSCGGTGGGC TOTAttMJIA CCC^OQTGCrP (3AAGAA1TGA CCTX5GTTCCT 

4131 gctgugcc&i? aaa^agcao oowattxcc wirororGA cacactotop- ccAexmxv ckthottagt 
4201 raMaxcA ctcataggal? actcatacscx 1 cacxwoxxt ccgcxtpcaa 'ircr.^ccGc 'i^AAai^crr 



4271 OCAiXOOTCT CTQOCWXXX CATC^CCCA CCWVCCAAA CCTAjSCCTCC AMftfflOXSA MAAATTAAA 

4341 CCAAGSWU3G CttATTAAflTC GAGAGCX&fiA GAAAA'POCCT CCAAOTOTS A33AAGTAAT GAOftiAAATc: 

4411 A'JEAGAAOTTC TTCCGCTTCC TCGCTCACTG ACTOGCTGCG CTCGGTCGTT CGGCTGCGGC GAQCGGTATC 

4481 AGCTCACTCA AAGGCGGTAA TACGGTTATC CACAGAATCA GGGGATAACG CAGGAAAGAA CAOTKSAQCA 
4551 AAAGGCCAGC AAAAGGCCAG GAACCCTAAA AAGGCCGCGT TGCTQGCGTT TTTOCATAGG CTCCGCCCCC 
4621 CTGACGAGCA TCACAAAAAT CGACGCTCAA GTCAGAGGTG GCGAAACCCG ACAGGACTAT AAAGATAOCA 
4691 GGCGTTFCCC CCTGGAAGCT CCCTCGTGCG CTCTCCPGMT CCGAGCCTGC GGCTTAOCGG ATACCTGTCC 
4761 GCCTTOCTCC CTTCGGGAAG CCTGGCGCTT TCTCAATGCT CACGCTGTAG GTATCTCAGT TCGGTGTAGG 
4831 TCGTTCGCTC CAAGCTOGGC TCTCTGCACG AACOCCCCCT TCAGCCOGAC OGCTGCQCCT TOTCCGCTAA 
4901 CTATCGTCTT GAGTCCAACC CGGTAAGACA CGACITATCG CCACTGGCAG CAGCCACTGG TAACAGGATr 
4971 AGCAGAGCGA GGTATGTPAGG CGGTCCTACA GAGTTCTTGA AGTGGTGGCC TAACTAOGGC TACACTAGAA 
5041 GGACAffEATT TGGTATCTGC GCTCTGCTGA AGCCAGTTAC CTTCGGAAAA AGAGTTQGTA GCTCTTGATC 
5111 CGGCAAACAA ACCACCGCTG GTAGCGGTGG TlTlTTOr TGCAAGCAGC AGATTACGCG CAGAAAAAAA 
5181 GGATCTCAAG AAGATCCITT GATCTTTTCT ACGGGCTCTG ACGCTCAGTG GAACGAAAAC TCACGTTAAG 
5251 GGATTTTGGT CATGAGATTA TCAAAAAGGA TCTICACCIA GATCCTTTTA AATTAAAAAT GAAGTTTIAA 
5321 A1CAATCTAA AGTATATATG AGTAAACTTG GTCTCACAGT TACCAATCCT TAATCAGTGA GGCACCTATC 
5391 TCAGCGATCT CTCTATPTCG TTCATCCATA GTTGCCTGAC TCCGGGGGGG GGGGGCQCTG AGGTCTGCCT 
5461 CGTGAAGAAG (HX3TTGCTGA CTCATACCAG GCCTGAATCG CCCCATCATC CAGCCAGAAA GTGAGGGAGC 
5531 CACGCTTGAT GAGAGCITTG TTCTAGGTGG ACCAGTFGOT GATTTTGAAC TTTTGCTITG CCACGGAACG 
5601 GTCTGCGTTG TCGGGAAGAT GCCTGATCTG ATCCTTCAAC TCAGCAAAAG TTCGATTTAT TCAACAAAGC 
5671 CGCCCTCCCG TCAAGTCAGC GTAATGCTCT GCCAGTGTTA CAACCAATTA AOCAATTCTG ATPAGAAAAA 

271< F F 

5741 CTCATCGAGC ATCAAATGAA ACTGCAATTT ATTCATATCA GGATTATCAA TACCATATTT TTGAAAAAGC 

269« E D L M L H F QLK NMD PNDI QYK QFL 
5811 CGTTTCTGTA ATGAAGGAGA AAACTCACCG AGGCAGTTOC ATAGGATGGC AAGATCCTGG TATCGGTCTG 

245* R KQL BPS F E G LCNW L I A LD.Q YRDA 
5881 CGATTCCGAC TCGTCCAACA TCAATACAAC CTATTAATTT CCCCTCGTCA AAAATAAGGT TATCAAGTCa 

222* IQ.V RGV D 1 C G ILK QED FILN DLS 
5951 GAAATCACCA TGAGTGACGA CTGAATCCGG TGAGAATGGC AAAAGCTTAT GCATTTCTTT CCAGACTT3T 

199* FDQ HTVV SO. P SFP L L K H * M E K WVQ 
6021 TCAACAGGCC AGCCATTACG CTCGTCATCA AAATCACTCG CATCAACCAA ACCGTTA3TC ATTCGTGATT 

175<E V P W GNR EDO FDSA DVL GNN MRSQ 
6091 ajGCCTGAGC GAGACGAAAT ACGCGATCGC TGTFAAAAGG ACAATTACAA ACAGGAATCG AATCCAACCG 

152< A Q A LRF VRDS NF P CNC VPIS HLR 
6161 GCGCAGGAAC ACTGCCAGCG CATCAACAAT ATT1TCACCT GAATCAGGAT ATTCTTCTAA TAOCTGGAAT 

129* RLF V A L A DVI N E G SDPY EEL VQF 
6231 Gcromrcc oqgggatcgc AGTGGTGAGT AACCATGCAT CATCAGGAGT aoggataaaa tgcttgatcg 

105<A TKG PIA TTL LWAD DPT RIF HKIT 
6301 TCGGAAGAGG cmaaattoc gtcagccagt ttactctgac catctcatct gtaacatcat tggcaacgct 
. 82« P L P MFE TL WN LRV MED TVDN A V S 
6371 ACCTTTGGCA TGTTICAGAA ACAACTCTGG CGCATOGGGC TTCCCATACA ATCGATAGAT TGTCGCACCT 

594 G K G HKLF L EP A DP KGYL R Y I TAG 
6441 GATTOCCCGA CATTATCGCX3 AGCCCATTTA TACOCATATA AATCAGCATC CATCTTGGAA TPPAATCQC*.i 
351S QGV NDR A W K YGYL DAD M N S N L R P 
6511 GCCTCGAGCA AGACGTTTCC CGTPGAAIAT GGCTCATAAC ACCCCTTCTA TTACTGTTTA TGTAAGCAGA 

12<RSC STE RQIH SM 
6581 CAGTTTEATT GTTTCATGATG ATKPAnTTT ATCTIX3TGCA ATGTAACATC AGAGATTTTG AGACACAACG 
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6651 TGGOTTCCC CCCCCCCCCA TTATFGAAGC ATTTATCAGG GTTATTCTCT CATGAGCCGA TACATATTTG 

6721 AATGTATTTA GAAAAATAAA CAAATAGGGG TTCCGCGCAC ATTTCCCCGA AAAGTGCCAC CTGACGTCTA 

6791 AGAAACCATP ATTATCATGA CATTAACCTA TAAAAATAGG CGTATCACGA GGCCCTTTCG TCTCGCGCGT 

6861 TTCGOTGATG ACGGTGAAAA CCTCTGACAC ATGCAGCTOC CGGAGAOGGT CACAGCTTGT CTCTAAGCGG 

6931 ATGCCGQGAG CAGACAAGCC CGTCAGGGCG CGTCAGCGGG TGTTGGCGGG TGTCGGGGCT GGCTTAACTA 

70O1 TGCGGCATCA GAGCAGATTG TACTGAGAGT GCAOCATATG CGGTOTGAAA TACCGCACAG ATGCGTAACX5 

7071 AGAAAATACC GCATCAGATT GGCTATTGG 



